RATIONAL METHODS IN MATRIX EQUATIONS! 
MARK H. INGRAHAM 


I. Introduction.’ I shall start with what I hope will not prove an 
overelaborate statement of the limitations of this paper in scope and 
treatment. I shall assume throughout a knowledge of the definitions 
of a field, a division algebra, of matrices and rational operations 
thereon. I shall also need to assume a knowledge of what is meant 
by the invariant factors and elementary divisors of a matrix. Little 
essential will be lost if the only field considered by the listeners is the 
rational number system and the only division algebra that of qua- 
ternions over the rational field. 

Consider a field and a system of constant matrices Ai, ---, Ay, 
unknown matrices X1,---, X, and equations 


(1) Ar X1,-°-, Xn) =0 


where the ¢,’s are polynomials with coefficients in &. If the elements 
of A; are aij, and of X; are x;i;x, (1) is equivalent to a system 


(2) =O 


where the y¥, are polynomials with coefficients in &. (If R is replaced 
by a division algebra } over &, the number of equations in (2) is merely 
enlarged.) We have therefore “reduced” the equations (1) to those 
of (2). This process we shall technically designate without great ex- 
aggeration as the “worst possible algorithm,” or, following modern 
style, W.P.A. This indicates that no “tour de force” which shows that 
ultimately a matrix problem can be solved in a finite time, but shows 
little else, is of interest. This is a topic in which the above simple 
proof of the existence of inelegant methods means that we need only 
pay attention to results that essentially use the matricial properties 
of matrices, only to results and methods having at least a minimum 
degree of elegance. 

Partly as a consequence of the above, this lecture, as is often the 
case, is not so much a description of broad theories of the nature we 
desire, as a report on what special cases have been found to be seduci- 


1 An address delivered before the Chicago meeting of the Society, April 14, 1939, 
by invitation of the Program Committee; presented in part April 15, 1939, under the 
title An algorithm for the solution of the unilateral matrix equation. 

2 The author wishes to thank H. C. Trimble, C. J. Everett, and J. H. Bell for help 
in preparing this paper. Their aid was made possible by the Research Committee of 
the University of Wisconsin. 
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ble. As a definition of the type of result I shall describe and the liter- 
ary style in which this description will be given, I will lay down, for 
this lecture, four canons. The first and last of these I would not defend 
as completely valid dicta. 

1. Irrational methods will be relegated to the background. In par- 
ticular, we will not assume that we can reduce every polynomial into 
linear factors. 

2. A theorem that is valid for the field of rational numbers is ipso 
facto of interest. This is particularly true if it extends to the case of 
all division algebras with finite basis over that field. (If a “field” had 
been defined to have characteristic zero and others to be “semi- 
fields,” we would, I believe, have had a more wholesome set of values 
than at present.) 

3. Proofs that are essentially algorithmic belong to the aristoc- 
racy. 

4. The language of matrices, bases, vectors, linear spaces, poly- 
nomials, greatest common divisors, and so on, will be used instead of 
operators, modules, ideals, lattices, and so on. This is in deference to 
(a) the non-algebraists, (b) the frequency with which generalizations 
avoided are merely formal, (c) my personal taste. 

The paper when presented as a symposium address included the 
discussion of the equation 7A =BT+C and a discussion of the uni- 
lateral matrix equation. Only the latter is included here since an ex- 
tended form of the former is being published elsewhere. (Ingraham [8].) 


II. The unilateral equation. We will now pass to the consideration 
of the unilateral (as to position of coefficients) equation 


(3) >> AX: = 0. 


We will endeavor to show that much more can be said of this equation 
than has heretofore been shown. However, at best the general treat- 
ment is not simple so that it is desirable to use when available simpler 
theories for special types of equations, or for yielding special types of 
solutions. 

What follows will be divided into three parts: 

1. The case where all the A; are polynomials in a single matrix A 
with scalar coefficients, and solutions X are sought which are of the 
same type. 

2. The case where (3) is of the type 


(4) ¥(X) =A 


where the polynomial y has scalar coefficients. 
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3. The general unilateral equation. 

The solution of case 1 was first given by Sziics* and though his 
methods are not rational, the rewriting in rational form, as here done, 
is not difficult. This type of solution had been given, however, for (4) 
by Roth,‘ and later by a method similar to that of Sziics by Franklin.5 

Consider 


(5) ¢(A, X) = 0 


and consider only such solutions X as are polynomials in A. We will 
also assume that the field & has characteristic zero. Let X(A) be a 
polynomial such that X(A) is a solution of (5). Hence ¢(A, X(A)) =0. 
A necessary and sufficient condition that this be true is that ¢(A, X (A)) 
be divisible by the first invariant factor of A, that is, the minimal 
polynomial h such that h(A) =0. Let g* be the highest power of an 
irreducible polynomial g contained in h as a factor. Hence if 
X(A)) =0, g* must divide X(A)). Since has characteristic 
zero this is equivalent to g dividing ¢, ¢’’,--- , (In case 
the characteristic is not zero the condition of divisibility is still work- 
able but not as neat.) 

Let X(A) =Xo(A)+Xi(A)g+X2(A)g*+ --- where the X; are re- 
duced mod g. 

Hence 


H(A, Xo) = 0 mod g, 


or in other words, ¢(A, X) has a zero in the field R[A]/{g(A)} for 
every irreducible factor g of h. 
Consider 


X) = ga(A, X) + ox(A, X)X’ 
= or(d, Xo) + ox(d, Xo)(Xo + Xig’) 
= (Xig’)ox(A, Xo) + vi mod g 
where y; is fixed by the determination of Xo and, in general, 
X) = Xo) + mod g 


where y, is determined in terms of Xo,--- , Xs-1. 
Note that since g is irreducible, g’ is prime to g. 
It is therefore necessary if k>1 that either 


(6) ox(A, Xo) Omod g 


3 Sziics [7]. 
* Roth [6]. 
6 Franklin [1]. 
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or that 
(7) ¥i = Omod g, #=1,---,k—-1. 


If for every irreducible factor g these conditions are satisfied, then 
we have for each g; an X,, such that X is a solution of ¢(A, X) =0 
if and only if 


X = X,, mod gi. 


That such an X exists follows from the Chinese Remainder Theorem 
for polynomials. Hence 


THEOREM 1. If & has characteristic 0, the equation ¢(A, X) =0 has a 
solution which is a polynomial in A if and only if for every irreducible 
factor g of the minimal polynomial of A there exist in R|d| solutions of 
o(A, X) =0 mod g which, in case g is a multiple factor of the minimal 
polynomial of A, satisfy (6), or (7). 


If the equation considered is of the form 6(X)=A where @ has 
scalar coefficients, then ’(A, Xo) = —1 and (6) above becomes neces- 
sary so that we can state 


THEOREM 2. If & has characteristic 0, the equation 0(X)=A has a 
solution which is a polynomial in A if and only if 6(X) =X mod g; has a 
solution in & [|], for every irreducible factor g; of the minimum function 
of A which, in case g; is a multiple factor of the minimal function of A, 
does not satisfy 0’(X) =0 mod gi. 


As a corollary of this, 


THEOREM 3. If R is algebraically closed and has characteristic 0, the 
equation X"=A has a solution X(A), a polynomial in A, if and only 
if 0 is not a multiple root of the minimum function of A. 


It is not difficult to compute the number of such functions. 

For certain cases the existence of solutions not polynomials in A 
is easily established by example. 

For instance 


has the solution 


© 6 6. 


| 
i 
x= | |: 
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It is interesting, therefore, to study the general solutions of equa- 
tions of the type discussed above. This is readily done for the equa- 
tions 0(X) =A, if A is nonderogatory, that is, its minimum equation 
is its characteristic equation. Since A is nonderogatory and every 
solution X is commutative with A, X is a polynomial in A. In this 
case our problem is completely solved. In case ¢ has scalar coeffi- 
cients, the general solution of 


= A 


is determined as follows.® 

If we find a matrix Y such that $(Y) is similar to A, then a non- 
singular matrix S may be found such that A = S¢( Y)S-'=¢(SYS) 
and hence SYS- is effective as X. Consider the invariant factors of 
A—NI, hi, he, --- , kx. Then @(Y) is similar to A if and only if the 
nullity of 4:(¢(Y)) equals the nullity of h;(A). Hence the invariant 
factors of Y must be divisors of 4;($(A)). Under this restriction and the 
fact that the nullity of any polynomial ¢ in Y is determined as the 
sum of the degrees of the greatest common divisors of the invariant 
factors of Y, with the polynomial ¢, diophantine equations may be 
written, any solution of which determines the invariant factors of a 
solution X, and hence determines X to within a transformation by a 
matrix commutative with A. All solutions are the transforms by 
matrices commutative with A of a certain finite dissimilar system of 
solutions X1, X2,--- , X,. Those X; which are polynomials in A are 
those solutions which are dissimilar to all other solutions. For exam- 
ple if 


0 0 0) 
0 ol, 
00 0} 
then 

1/kO0 0) 

with 


This work generalizes with only moderate difficulty to the case 
where the scalars involved belong to a division algebra provided we 
ask the question correctly.? In the case where the scalars are in a 
field and ¢=)>_\‘a; then ¢(X) =A is equivalent to o(X)E=>_X ‘ta; 


6 Ingraham [2]. 
7 Ingraham [3]. 
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=Aé for every vector &. This is not the case if the a; belong to a 
noncommutative division algebra D. The correct question which can 
be answered is not to find a solution of ¢(X)=A, but given a set 
[a;] in D to find a matrix X such that >> X‘ta;=Aé for every &. This 
problem turns out to be equivalent to a system of equations 


o:(X) = A; 


where the coefficients of ¢; are in the centrum of the division algebra 
D. 


Let us now consider the general unilateral matrix equation 
(8) LAX! = 0, 


where the A; are m Xn matrices of constants. 

Fundamental to the consideration of (8) is the fact that if X is any 
constant matrix then A (A) =>>A iA‘ can be expressed in one and only 
one way as 


— X) + Bo 
and in fact 
Bo = 
Hence 


THEOREM 4. X is a solution of ).A;X‘=0 if and only if \I—X is a 
right factor of the matrix A(\) 


Theorem 5, which is a corollary of a theorem due to Phillips,® 
yields us an algorithm better than the W.P.A. This theorem is as 
follows: 


TueEoreM 5. If X is a solution of )A:X‘=0, then the determinant 
of the matrix \I —X divides the determinant of the matrix >A ii. 


This is, of course, a corollary of Theorem 4. The proof originally 
given was far more complicated than this proof which is due to 
C. C. MacDuffee. Phillips’ theorem yields a necessary condition on 
the invariant factors of any solution. If Y has such invariant factors 
and X = PYP-' is a solution, then >A iP Y‘=0. The problem is then 
reduced to finding all P’s satisfying this equation for canonical Y’s 
with possible invariant factors. 

Far stronger conditions can be secured from Theorem 6, which is a 
consequence of the following lemma. 


8 Phillips [5]. 
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Lemma. If A, B, C, are \-matrices such that A = BC, then the invari- 
ant factors of B (or C) divide the corresponding invariant factors of A. 


Thus 


THEOREM 6. If X is a solution of }A:X‘=0, then the invariant 
factors of the matrix }I—X are divisors of the corresponding invariant 
factors of the matrix d+. 


This eliminates many cases not strained out by the sieve of Phillips’ 
theorem. 

That these conditions or any conditions in terms of the invariant 
factors of A(X) are insufficient to guarantee the existence of solutions 
with a given set of invariant factors for \J—X is shown by the two 


0 


The A-matrices for these are respectively 


Gs) 


each of which has invariant factors \?, 1; but the first equation has no 
solution and the second has the solution 


0 

We now pass to the description of an algorithm believed to be new 
for the solution of (8). 

We shall call a matrix with elements polynomials in \ unimodular 
if its determinant is a scalar not zero, or what is equivalent, if it is 
nonsingular in the field of rational functions of X, and if its reciprocal 
has elements polynomials in i. 

We say that A is in triangular form if all the elements below the 
main diagonal are zero. We say it is in canonical triangular form if it 
is in triangular form and all the elements above the main diagonal are 
of lower degree than the elements in the same column on the main 
diagonal and if when a zero occurs on the main diagonal the whole 
row in which it occurs is zero. We also specify that the leading coeffi- 
cients of the polynomials along the main diagonal be 1. 


and 


| 
equations 
0 0 
|| 
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If U is unimodular, then AJ —X is a right factor of A if and only 
if it is a right factor of UA. Moreover, a unimodular matrix U may 
be found such that UA is in canonical triangular form, this form be- 
ing completely determined by® A. From now on we will assume that 
A is in such form. 


Let 
0 Ge2 23° Gen 
0 0 433° Gan 


where the a;;’s are polynomials in X. 
If A=H(AI—X) and if U is the unimodular matrix such that 
U(AI —X) is in canonical triangular form, then 


A = HU“U(AI — X) 


and T= U(AI—X) is a right factor of A. 

Our problem is therefore reduced to finding the triangular factors 
of A which are the canonical triangular forms of matrices of type 
\I—X where X is independent of X. 

We study then the problem of finding canonical triangular factors 
T of the matrix A and then of selecting those which are the canonical 
triangular forms of matrices of type AJ—X for some X. 

If A =ST where the matrices A and T are in triangular form, S is 
in triangular form, and 


i 
(9) = Dy $= = 0,---, 8 — 
k=0 


For the case j=0 these relations reduce to 
aig = 


and hence to the fact that the diagonal elements of T are factors of 
the corresponding diagonal elements of A. 

Equations (9) give linear congruences each conditioning (not nec- 
essarily uniquely) the possible ¢;; in terms of previously determined 
elements of T below ¢;; occurring in the jth column and elements of 
s to the left of s;; in the ith row—these may be considered therefore 


MacDuffee [4]. 
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as recursive formulae for the ¢;;. In these congruences the coefficient 
of t;; is s;; and the modulus is ¢;;, so that, in case & has infinitely many 
elements, an infinitude of solutions exists if and only if s;; and t;; are 
not relatively prime and at least one solution exists. 

It should be noted that if T is the triangular form of M, the product 
of the first 7 diagonal terms is the greatest common divisor of the 
determinants of the ith order minors of the first 7 columns of M. 
Hence if T is of the form U(AI—X) where U is unimodular, the sum 
of the degrees of the first 7 terms of the principal diagonal of T must 
be less than or equal toz and the sum of the degrees of all the diagonal 
terms is m, the order of T. I believe in most equations written at 
random these last conditions would preclude a solution. 

We turn now to determining X such that a T satisfying the above 
conditions is the triangular canonical form for some (AJ—X). It can 
be shown (but the proof is omitted) that X is unique if it exists. 

If T= U—(AI—X) where U is unimodular, then 


(10) U = (I — X)T-. 


If we choose a matrix X, (AI—X)T~- will have elements rational 
functions of \ but not, in general, polynomials. 

The problem therefore reduces to seeking X such that (AJ —X)T™ 
is a matrix with elements polynomials in X. 

Let 7-!= F/d where F is in triangular form and d is the determi- 
nant of T. The f;; will be polynomials. 

If (10) is fulfilled 


= (Afi; xinfe;)/d; 
hence u;; is a polynomial if and only if 
=> mod d. 


Since the f;; are of degree less than d, these linear congruences are 
easily reduced, by equating coefficients, to m systems of linear equa- 
tions for the x; (z fixed) all of the systems having the same matrix 
of coefficients, the rank of which must (in light of the uniqueness 
theorem) be 1 if a solution of all the systems exists. 

Since this paper was delivered at Chicago Mr. J. H. Bell has fur- 
nished me with the following note on the conditions upon T for a 
matrix X to exist such that T is a triangular form of \J—X. 

The existence of X depends directly upon the coefficients of the 
elements of T. 

We may obtain an Xn matrix D whose elements are scalars by 
the following steps. 


\ 
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1. Augment each element of T by the proper powers of \ with zero 
coefficients so that terms of the same degree as in the corresponding 
diagonal element appear. 

2. Break up each column into separate columns, each one of which 
involves only monomials of the same degree in \, for example, 


aAtd on, d 
3. Delete the columns which do not involve X. 
4. Set \=1 obtaining D. 
A necessary and sufficient condition that there exist a matrix X 
such that T= U-(AJ—X) is that | D| <0. 


If Tdi involving scalars only), then the above con- 
dition is equivalent to saying that m is the rank of 


M = (T,, Ty-1,°++ Ts, 


X may be obtained quite readily from D. We first find B =(0;,) as 
follows. If ¢;;41, (bj: - - - bjn) =(0--- 0, —1,0- -0)D-! where the 
— 1 occurs in the position corresponding to the column of D contain- 
ing ¢;,;; (where tind). If =1, (61 - - bin) where 
is a 1 Xm vector obtained by taking the /th row of the matrix ob- 
tained from the matrix in step 2 by deleting the columns involving 
the leading term of tix, for every k, and setting \=1. 

Then 

x == BT». 
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UNIVERSITY OF WISCONSIN 


MAXIMUM OF CERTAIN FUNDAMENTAL LAGRANGE 
INTERPOLATION POLYNOMIALS! 


M. S. WEBSTER 


This note extends some of the results obtained in a previous paper? 
which we shall designate as I. The notations are the same. 
We are concerned with the polynomials 


Gn (xx)(x — xx) 


where ¢,(x) =(x—%1)(x—42) - - - (x—x,) is the Jacobi polynomial of 
degree m which satisfies the differential equation (1—x?)¢,/’ (x) 
+ [a—B—(a+B)x (x) +n(n+a+fB—1)¢,(x)=0. The parameters 
a, B are positive and m is a positive integer. It is known that 
<x,<1. Throughout the paper, x is always re- 
stricted to the interval —1<x<1. 

It was shown in I, for example, that, if a=8=#%, max | 12 (x)| 
and as n> @. 

Now we use* 


+ B)T(n + a + B — 1) 
+ a+ — 1) 


and the asymptotic expressions‘ 


on(1) = 


\1/2-« 
¢,(cos 0) = (sin =) (cos =) 
(xn)'?T'(2n + a + B — 1) 2 
- {cos [V@ — (28 — 1)x/4] + 6)-'0(1)}, 
¢n(cos 0) = (sin (cos 
T'(2n + a+ B — 1) 2 


61/20 —3/2 
T'(n + 1) 


sin 


where N=n+(a+6—1)/2, c, € positive constants and 


1 Presented to the Society, April 13, 1940. 

2M. Webster, Note on certain Lagrange interpolation polynomials, this Bulletin, 
vol. 45 (1939), pp. 870-873. 

3 C. Winston, On mechanical quadratures formulae involving the classical orthogonal 
polynomials, Annals of Mathematics, (2), vol. 35 (1934), pp. 658-677. 

4 G. Szegé, Orthogonal Polynomials, American Mathematical Society Colloquium 
Publications, vol. 23, 1939, pp. 191-192, 121, 123. 
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where J,,(x) is Bessel’s function of order m. Since @,! (x; a, B) 
=nd,-1(x;a+1, 8+1), these yield immediately the following results: 


Lemma. For x, such that —1+¢Sx,S1—€ and |x—x,| 2e’>0, 
max | f(x) | +0 as n— © even if x>+1 (a, B<#; €, €’>0). 


THEOREM 1. For x; such that and |x—x,| 2€’>0, 
max | 1 (x)| = O(n) as n— © where max (a, B) = y >#; €,€’ >0. The ex- 
ponent y cannot be decreased. 


The method used in the proof of Theorem 5 in I really gives the 
following slightly stronger result: 


THEOREM 2. Jf —1+¢’Sx<1-€’, max | 12 (x) | 
—1 as n—o (e, €’>0). 


Combining Theorem 2 and the lemma, we obtain the following: 


THEOREM 3. For x; such that -1+€Sx,S1—€, max |/"(x)|—1 as 
n— © (a, B<H, e>0). 


This result is a considerable improvement over Theorem 5 in I. 
Moreover, if the hypothesis —1-++-«<x,<1—e is removed, the theo- 
rem is not true as Erdés and Griinwald® showed in case a=8=}3. In 
view of Theorems 1, 4, 5, 6, the restriction a, 8B << is also necessary. 
In particular, this theorem holds for the case of Tschebycheff 
(a=8=4) and Legendre (a2=8=1) polynomials. 


THEOREM 4. If and as n—~, then max |I(x)|—1 
+|t| as ne (—1<tS1). This is also an upper bound if | x;| <|t| 
at least for large values of n. 


Proor. It was shown in I that and for 


max | 1” (x) | <1.87. Since (1) max | 1x) | is attained either between 
Xe41 and x,_; or at x= +1, the theorem is valid for /=1 and, by sym- 
metry, for = —1. 

If | 2| <i, the preceding paragraph and Theorem 2 complete the 
proof. In fact, max ta (x)| =i+ | x.| at least for large n. 

The next two theorems are obtained in a similar manner. 


THEOREM 5. If a=}, B=% and x,—1t as n— ~~, then max | 1” (x) | 
4 if t=—1, 1if —1<t< —}, if 


THEOREM 6. If a=, B=} and x,—t as n—@, then max | 2” (x) | 
if 1 if St<1, if t=1. 


5 Erdés and Griinwald, Note on an elementary problem of interpolation, this Bulle- 
tin, vol. 44 (1938), pp. 515-518. 
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The max | (x) | is attained at x=+1 since* (I) O:4:—- 
provided } <a, BS} and x,=cos Using the 
second asymptotic formula and the fact‘ that 0,—j, as n— © where 
jx is the kth positive zero of Jg_1(x), we find that 


| (1) | (47%) | T(B)J | as n— ©, k constant, 


1 (—1)-30 which proves the theorem: 

THEOREM 7. Max | (x)| >(4j:)*-?| as (where 
Sa, BSH, is first positive zero of 

A similar result holds for (x) if B is replaced by a. 

For Legendre polynomials (a=$=1) this limit is approximately 
1.602. For a=B =} and a=6 = # the limit of Theorem 7 is also an up- 


per bound for max | 1 (x) | and max |/{”(x)|. Whether this is true, in 
general, remains unanswered. 


PuRDUE UNIVERSITY 


AN INVARIANCE THEOREM FOR SUBSETS OF 5S"! 
SAMUEL EILENBERG 
The purpose of this paper is to establish the following. 


INVARIANCE THEOREM. Let A and B be two homeomorphic subsets of 
the n-sphere S*. If the number of components of S*—A 1s finite, then tt 
ts equal to the number of components of S*—B. 


In the case when A and B are closed this theorem is a very well 
known consequence of Alexander’s duality theorem and its generaliza- 
tions. In our case we also derive our result as a consequence of a 
duality theorem. However, the duality is established only for the di- 
mension n—1. 

Given a metric space X we shall say that I“ is a k-cycle in X if 
there is a compact subset A of X such that I is a k-dimensional con- 
vergent (Vietoris) cycle in A with coefficients modulo 2. We shall 
write '*~0 if !*~0 holds in some compact subset of X. The homol- 
ogy group of X obtained this way will be denoted by X*(X) ; the cor- 
responding connectivity number, by p*(X). The number p*(X) can 
be either finite or ©. 


1 Presented to the Society, December 28, 1939. 
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DUALITY THEOREM. Let A CS" and let 20, 21, - - + , 2m belong to m+1 
different quasi-components? of S*—A. There are m linearly independent 
(modulo 2) (n—1)-cycles 


(1) 
of A such that 
n—l 0 


where y§ is the 0-cycle 2o+2; (consisting of the two points 2, 2; each of 
them with coefficient 1) and v(T, y) is the linking number. 

In case S*—A has only m+1 quasi-components, the cycles (1) form a 
basis for 


ProorF. In case A is closed the theorem turns out to be a particular 
case of the generalized Alexander duality theorem.* We shall prove 
our theorem for arbitrary sets A using the theorem for closed sets. 

Since 2, 21, - , 2m belong to m+1 different quasi-components of 
S"—A there is a decomposition S*—A =A o+Ai+ ---+Am such 
that A; and A;A;+4;A;=0 for 1, j=0, 1,---,m. Let Bo, 
B,, -- -, Bx be open disjoint sets such that A;CB;fori=0,1,---,m 
and let B=S*—(Bo+Bi+ --- +B,,.). Clearly B is a closed subset 
of A and 2p, 21,---, 2m belong to m+1 different quasi-components 
(equals components) of S*—B. 

Applying the duality theorem to the closed set B we obtain the 
cycles (1) satisfying (2). In order to prove that they determine lin- 
early independent elements modulo 2 of %*—1(A) consider a cycle 
I"-'=a,Tt-'+ --- +a,1%"' where a;=0, 1. It follows from (2) that 


y}) =a;. Therefore in A implies - - - =an=0. 
Suppose now that S*—A consists of exactly m+1 quasi-compo- 
nents. It follows that the sets Ao, Ai, -- - , Am are connected. 


Let ['*-' be an (m—1)-cycle of A contained in some closed set 
DCA. Let E; be the component of S*—(B+D) containing A; 
(«=0, 1,---, m) and let --- +£,,). It follows 
that (1°) E is a closed subset of A, (2°) S*—E consists of exactly 
m-+1 quasi-components (equals components), (3°) the points 


2 Two points x1, x2€ X belong to the same quasi-component of X if there is no de- 
composition X = A;+Az such that and A:A2+A1A2=0. If the number 
of quasi-components of X is finite then every quasi-component is a component. 

3 Alexander, J. W., Transactions of this Society, vol. 23 (1922), pp. 333-349; 
Frankl, F., Sitzungsberichte der Wiener Akademie der Wissenschaften, Mathe- 
matisch-Naturwissenschaftliche Klasse, 2A, vol. 136 (1927), pp. 689-699; Alexan- 
droff, P., Annals of Mathematics, (2), vol. 30 (1928), p. 163. 
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Zo, 21,°°*, 2m belong to different quasi-components of S*—E, 
(4°) the cycles (1) and ['*~! are contained in E. According to the 
duality theorem for closed sets the cycles (1) form a basis for 3¢"—!(E). 
This implies the existence of a1, dz, -- + , dm (a;=0, 1) such that 


This proves the theorem since ECA. 
Given a metric space X let the number bo(X) be defined as follows: 


b(X) =0 if X =0, 
bo(X) =m if X ¥ Oand X has exactly m+1 components, 
bo(X) = @ if X has an infinity of components. 


Clearly the value of bo9(X) remains unchanged if we replace in its 
definition components by quasi-components. The duality theorem im- 
plies therefore the following: 

(I) For every subset A of S" we have 


= bo(S* — A). 
(Il) For every two homeomorphic subsets A and B of S” we have 
bo(S* A) bo(S* B). 


The invariance theorem stated in the introduction follows directly 
from (II). 

If X consists of an infinity of components, then instead of taking 
bo(X) = © we could define bo9(X) to be the cardinal number corre- 
sponding to the class of all components of X. Similarly p*(X) could 
be redefined as a cardinal number. But with these new definitions (I) 
and (II) are no longer true.‘ In fact, let A be a subset of S' such that 
S!—A is closed and enumerably infinite, and let B be a subset of S! 
such that S'—B is perfect and non-dense. It is clear that A and B 
are homeomorphic, that bo(S'—A)=p°(A)=p%B)=No, and that 
— B) = 2%, 


UNIVERSITY OF MICHIGAN 


* That (II) is no longer true was first pointed out to me by Dr. L. Zippin. 


MEASURABILITY AND MODULARITY IN THE 
THEORY OF LATTICES! 


M. F. SMILEY 


In a previous note? a notion of measurability (with respect to a 
function u) of elements of an arbitrary lattice was introduced. Our 
purpose there was to study closure properties of the subset of measur- 
able elements. To do this it was convenient to assume that the lattice 
was modular. Results of V. Glivenko,? and of L. R. Wilcox and the 
author‘ indicate that the idea of measurability and that of modularity 
are intimately related. The purpose of this note is to exhibit a further 
relationship which does not depend on metric properties of the func- 
tion p. 

In a lattice L we call the ordered pair (a, b) of elements of L modu- 
lar and write (a, b)M in case (a;+a)b=a,+<ab for every a; <b. This 
relation has been studied by L. R. Wilcox.' If (a) is a real valued 
function defined on L we say that an element cCL is u-measurable in 
case 


u(c) + = w(c + b) + u(cb) 


for every bCL. The symbol L(y) will denote the totality of elements 
of L which are w-measurable. We call wu proper in case a<b with 
p(a) implies a=b. 

Before we discuss the general case, let us consider a lattice Lo of 
finite dimension (that is, satisfying both chain conditions) in which 
every principal chain joining two elements a, b€ Ly has the same 
length n(a, b). Examples of such lattices are well known.® Let 0 de- 


1 Presented to the Society, February 24, 1940. 

2 A note on measure functions in a lattice, this Bulletin, vol. 46 (1940), pp. 239-241. 
This concept specializes to that of permutability with all subgroups (that is, the 
quasi-normality of O. Ore, Structures and group theory 1, Duke Mathematical Journal, 
vol. 3 (1937), p. 162) if we set u(s)=log o(s) in the lattice of subgroups of a finite 
group, and to that of measurability in the sense of Carathéodory with respect to an 
outer measure function. 

3 Contributions al étude des systémes de choses normées, American Journal of Mathe- 
matics, vol. 59 (1937), pp. 933-934. 

4 Metric lattices, Annals of Mathematics, (2), vol. 40 (1939), p. 313. 

5 Modularity in the theory of lattices, Annals of Mathematics, (2), vol. 40 (1939), 
p. 491 ff. 

6 The “exchange” lattices discussed by Saunders Mac Lane (A lattice formulation 
for transcendence degrees and p-bases, Duke Mathematical Journl, vol. 4 (1938), pp. 
455-468) as well as the “semi-modular” lattices of finite dimension of Wilcox (op 
cit., pp. 502-505) have these properties. We make no use, however, of the metric 
properties of these systems. 
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note the least element of Ly and define uo(a) =n(0, a). We note that po 
is proper. These considerations lead to the following theorem. 


THEOREM 1. If cE Lo then for every Sc if and only if 
ac implies that (b, a) M for every bE Ly and furthermore that x =ax+5b 
for every for which bSxSa+b. 


Proor. To show the sufficiency of the condition consider elements 
b, for which Sc. It is easily seen that Lo(uo) provided that 
n(c\b, =n(b, +b). We shall prove in fact that 
As usual, for that is, define 
Clearly f(x1)€Ga+b/b. Also =c.+6/b; for if by 
hypothesis f(ax)=x. Suppose f(x1)=f(xe2), that is, 
Clearly By hypothesis c,:)M, and it follows 
that x1=x2. Thus establishes a one-to-one cor- 
respondence between ¢:/c:b and c:+b/b. This correspondence pre- 
serves the ordering relation and hence ¢,/ab™c,+b/b. The proof of 
the sufficiency of the condition is complete. The proof of the necessity 
is included in the proof of the following theorem. 


THEOREM 2. Suppose that u(a) is a proper real valued function de- 
fined on a lattice L. Consider an element cEL. If «.€ L(y) for every 
cic, then (b, a)M for every aSc and every bEL, and furthermore 
x=ax+b for every xEL such that bSxSa+b. 


Proor. Consider elements a, b€ L with a Sc. Toshow that (b, a) M, 
it suffices, since u is proper, to prove that 
+ b)a) = + ab) 
for every a; Sa. By hypothesis L(u) and consequently 


+ b)a) = + 6) + w(a) — w(a + 5) 

+ 6) + w(ab) — 

u(ai) + w(b) — w(aib) + w(ab) — 
u(ai) + (ab) — w(aib) = + abd). 


Thus we have (b, a)M. Now consider an element xCL for which 
bsxSa+tb. Clearly ax and hence a, axC L(y). It follows that 


+ 6) = w(ax) + — u(ad) 

u(x) + — w(a + x) + — p(ad) 
= p(x) + w(a) + w(b) — w(a + 6) — (ab) 
h(x). 
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Since » is proper we have x=ax+5 and this completes the proof. 


Coro.iary. If pEL is a point, then pEL(p) implies that b+ p is 
prime over b for every bEL such that bp=0. 


Proor. Consider a point pC L(u) and an element b€L for which 
bp=0. We always have (b, p) M and thus the first part of our conclu- 
sion is vacuous.’ The second part yields x=b+-px for every xEL for 
which b<x<b+>. Since px=0 or px=p, it follows immediately that 
b+? is prime over b. 

REMARK. Consider the example of a lattice of six elements 0, p, q, d, 
c, 1, with p<c, g<b, 0 the least element, and 1 the greatest element. 
Our corollary yields the fact that no proper real valued function yu can 
be defined over this lattice for which p or q is y-measurable. 


LEHIGH UNIVERSITY 


7L. R. Wilcox, op. cit., p. 491. 


A STIELTJES INTEGRAL EQUATION! 
F. G. DRESSEL 


Introduction. By making use of the notion of derivative taken with 
respect to a function of bounded variation, solutions of the Young- 
Stieltjes integral equations of the following type 


(1) F(x) = M(2) +2 f H(x, y)4F(y) 


are obtained in this paper. Since the integration by parts formula is 
not valid for Young-Stieltjes integrals, equation (1) cannot be im- 
mediately changed to the equation 


(2) f(z) = m(2) +2 f(y)dK(», 9). 


Fischer? in his treatment of equation (2) imposed three conditions 
on K(x, y) to obtain solutions. In the second part of this paper, we 
show that one of these conditions is sufficient to insure the existence 
of a bounded monotone increasing function g(y) such that 


(3) | K(x, ys) — K(x, | S| g(y2) — g() |. 


It has been shown? that if condition (3) holds, then equation (2) can 
be changed into an ordinary Fredholm equation. Thus it appears that 
only one of the three conditions imposed on K(x, y) by Fischer is 
needed to insure the solution of (2). 

Before handling equation (1), let us note that if g(y:) <g(y2) for 
41 <2, We may apply the transformation‘ 


(4) A(s) = lim sup E,[s = g(y)], (0) S s S g(1), 


and send the integral equation 
1 
(5) = w(2) +2 f G(x, 
0 


into the Fredholm equation 


1 Presented to the Society, February 24, 1940. 

2C. A. Fischer, The Fredholm theory of the Stieltjes equation, Annals of Mathe- 
matics, (2), vol. 25 (1923-1924), pp. 124-158. 

All functions used in this paper are assumed to be measurable Borel. 

3 F.G. Dressel, A note on Fredholm-Stieltjes integral equations, this Bulletin, vol. 44 
(1938), pp. 434-437. 

4 FE, [s>g(y)] designates the set of values of y such that s2g(y). 
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(6) (8(i)) = +» f G(B(0), B(s))0(6(s))ds, 


where a=g(0), 5=g(1), and ¢ is any of the possible solutions of 
«= Bi), g(1). 
Now in L? the Fredholm theory applies to (6) if 


1 1 6 5 
7) f f | G(x, y) [*de(2)dg(y) = f f | G(B(2), B(s)) |2dtds 


exists.5 Also if h(t)C L?, we have by Schwartz's inequality 
| 1 2 1 1 
0 0 0 


and from this we see [}G(x, B(t))h(t)dt is integrable with respect to 
g(x) if (7) holds. The above statements lead to the following lemma: 


LemMa~ 1. If g(x1) <g(x2) for x1<x_ with g(x) bounded, and if under 
the transformation (4) the functions w(x), G(x, y)C L*, and also if (7) 
holds, then for all values of X, except perhaps the characteristic values* 
of the integral equation (6), the equation (5) has a solution integrable 
with respect to g(x). 


Stieltjes integral equations. Consider now the equation (1) where 
the given functions M(x) and H(x, y) are assumed to be absolutely 
continuous® g(x). Here g(x) is a bounded increasing function, continu- 
ous from the right except perhaps at x=0. Let m(x) and h(x, y) be 
the derived numbers of M(x) and H(x, y) taken with respect to g(x), 
and form the equation 


(8) = w(s) +2 f 


We see from our lemma that equation (8) will have a solution under 


5 For list of references and terminology see E. Hille and J. D. Tamarkin, On the 
characteristic values of linear integral equations, Proceedings of the National Academy 
of Sciences, vol. 14 (1928), pp. 911-914. 

* For terminology see A. J. Maria, Generalized derivatives, Annals of Mathematics, 
(2), vol. 28 (1926-1927), pp. 419-432. Let f(x) be a function of bounded variation and 
denote by f(e) the completely additive function of sets defined by f(x). (The facts here 
are that for any interval a(x; St<-x2) the relation f(a) =f(x2+0) —f(x: —0) holds.) The 
function f(x) is said to be absolutely continuous with respect to the non-decreasing 
function g(x) if whenever E is a set for which f(E) is defined and g(E)=0, then 
=0. 
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rather weak restrictions on m(x) and h(x, y). Let us now assume (8) 
has a solution, say a(x), which is integrable g(x), and define the func- 
tion Q(y) as 


Q(y) = f 
0 


= 0, y= 0. 


We are now able to write the solution of (1) as 
1 
9) F(x) = M(x) H(z, 400). 
0 


To see that the above F(x) furnishes a solution of (1), substitute 
the right side of (9) in both sides of equation (1). After cancelling 
M(x) we have 


f H(x, y)dQ(y) J H(x, y)dM(y) 


(10) 1 1 
| A(x, | Aly, dO(2). 
+ i, (x, y) (y, t)dQ(t) 


Since a(x) is a solution of (8), on integration of that equation with re- 
spect to g(x), there results 


Q(x) = M(x +0) — M(0) +2 f [H(x + 0, t)— H(O, x £0, 


= M(x+0) — M(0) + rf [H(x + 0,4) — H(0,#)|dQ(), x0, 


= 0, x=0. 


If the above is substituted in the left side of (10), it is easily seen to 
be equal to the right side, hence (9) satisfies (1). 
From the above and our lemma we have the following theorem: 


THEOREM 1. Let g(x) be continuous from the right in 0<x <1, and 
satisfy the conditions of Lemma 1; let M(x) and H(x, y) in equation (1) 
be absolutely continuous g(x) and have m(x) and h(x, y) as derived num- 
bers (g(x)). Then if m(x), h(x, y) satisfy the conditions placed on w(x), 
G(x, y) respectively in Lemma 1, equation (9) furnishes a solution of (1) 
for all values of except those that are characteristic values of the integral 
equation (8). 


7F.G. Dressel, loc. cit., p. 435. 
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Coro.Luary. If 
|H(x2, y) — H(x1, y)| — g(x) |, 
| M(x2) — M(x)| S| g(x2) — g(x) | 


where g(x) meets the requirements of Theorem 1, then (9) is the solution 
of (1) for all values of d that are not characteristic values of the integral 
equation (8). 


The Fischer theory. In the treatment of equation (2), Fischer im- 
posed the following condition on the kernel K(x, y) =K(x, y+0): 


V2K(x, y) < N. 
Here N is a constant and V2K has the following meaning: 
V2K(x, y) Lu.b. | K(xi, yi) 7 K(xi, ¥i-1) |; 
i=l 


it being understood that O=yo<yi< --- <y,=1, and {x;} are any 
x’s in 0<x<1. We shall soon show that the above condition implies 
that inequality (3) holds. 

Define the set E on the interval 0< y <1 to be given by 


y = m/2*, m=0,1,2,---,m32*%, n=1,2,---, 
and let g(x, y) be a function with the properties 
(a) g(x,y) = g(y, x) 20, «g(x, y) S g(x, 2) + g(z, y) 


and if OS yo<yi< --- <ynS1 then 


(b) Dd elvis vin) < for all m. 


i=1 


Let y;CE and define A,(y,) as 


Due to the conditions on g(x, y) we have 
A,(yj) S < 


hence lim, A,(y,) exists: let us call it A(y,). Again let 1, y2CE with 
then if y,=7r/2*, ye=m/2*, 
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Ax(y2) — = > 


i=r+1 2 


1 
) = g(ye, 91); 
hence 


Ax(y2) — Ax(yi) 2 2 O. 


Thus A(y) as defined on E is a bounded monotone increasing function; 
hence we may extend the definition of A(y) on CE so as to define a 
monotone increasing function of bounded variation® on 0S y<1. We 
make A(y) unique by taking A(y+0) =A(y) if yCCE. 

Turn now to Fischer’s condition V2K(x, y) and define 


g(y2, ¥1) = Lu.b. | K(x, y2) — K(x, 1). 
0szS1 


This g(x, y) is easily seen to satisfy conditions (a) and (b); hence the 
following theorem is valid. 


THEOREM 2. If for the function K(x, y)=K(x, y+0) the condition 
V2K (x, y)<N holds, then there exists a bounded monotone increasing 
function A(y) such that 


| K(x, — K(x, S| A(y2) — A(y) |. 


COROLLARY. The condition V2K(x, y)<N imposed on K(x, y) 
=K(x, y+0) by Fischer is sufficient to change equation (2) into an 
ordinary Fredholm equation.® 


The existence of the following limit is essential in Fischer’s treat- 
ment of equation (2): 


lim >> [K(xi, — K(xi, xi-1)], <2, =1, 
6-0 j=l 
where §6=max (x;—x;-1). The Fischer theory will therefore not apply 
to the kernel 
K,(x, y) = 1, x = 5, 
= Q, elsewhere. 


However, the integral equation (2) with the above kernel is easily 
treated since V2Ki(x, y) <2. 


BROWN UNIVERSITY AND 
DvuKE UN‘VERSITY 


8 G. C. Evans, Logarithmic Potential, American Mathematical Society Colloquium 
Publications, vol. 6, 1927, pp. 5-6. 
®F, G. Dressel, loc. cit., p. 436. 
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FOURIER COEFFICIENTS OF BOUNDED FUNCTIONS! 
BERNARD FRIEDMAN 


The results of this paper are divided into two parts. First: inequali- 
ties for the Fourier coefficients of any bounded function; second: an 
approximation theorem for the Fourier development of an arbitrary 
bounded function. 

Inequalities for Fourier coefficients have been discussed in a paper 
by Professor Szdsz.? However, his work deals mainly with linear in- 
equalities for complex coefficients. The inequalities to be investigated 
in this paper are not linear. Nevertheless, they are the best possible, 
for this reason: given any set of numbers which makes the inequality 
an equality, there exists a bounded function which has these numbers 
as its Fourier coefficients. 

A simple illustration will clarify this. Let f(x) be a bounded meas- 
urable function in (—7z, 7) such that | f(x) | <1. The Fourier coeffi- 
cients of f(x) are given by the formulae 


‘= -f f(x) cos nx dx, b, = —f f(x) sin nx dx, 
us 


1 
—f | cos nx | dx, 
1 
—f | sin nx | dx 
since | f(x)| <1. 


Since the cosine is negative in the intervals (—7, —2/2) and 
(x/2, x) and positive in the remaining interval (—7/2, 1/2), 


Then, it is clear that 


1 
s— f | f(x) || cos mx| dx 


TAN 


IIA 


| <— f "| sin nx| dx 


f | cos «| dx = f (— cos x)dx + cos x dx 


—1/2 


+f (— cos x)dx = 4. 


Therefore, | <4/r. However, if fi(x) =—1, —t<x<—7/2; fi(x) 


1 Presented to the Society, September 8, 1939. 
2 American Journal of Mathematics, vol. 61 (1939). 
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=1, —4/2<x<m/2; fi(x) =—1, r/2<x<7; then 


f fi(x) cos xdx = f (— cos x)dx + cos x dx 


—1/2 
+f (— cos x)dx 


and the bound 4/z will actually be attained for the function f,(x). 
Likewise, cos nx|dx and J”, | sin nx|dx both equal four, so that 
|a,| and |b,| $4/2; these bounds will be attained for the functions 
signum cos mx and signum sin mx respectively. 
But these bounds for the Fourier coefficients cannot be attained 
simultaneously. If one of the coefficients is too large, the others must 
be small. For example: b; and b2 must satisfy the following inequality: 


(= bn) + 

4 1 4 

so that if =4/m then while if b.=4/m then b,=0. Conversely, 
any numbers j, 2 satisfying this relation will be the Fourier coeff- 
cients of some bounded function. 


Similar relations hold between the other Fourier coefficients, but 
they are much more complicated: 


= 

lA 
| 


3 
< < (=s.- t) - (= 1) + 1. 
4 4 4 


After the inequalities for the Fourier coefficients are obtained, they 
are used to derive an approximation theorem. But instead of being 
an approximation to the function itself, it will be an approximation 
to the Fourier development of the function. More precisely: take a 
bounded measurable function f(x) whose absolute value is less than 
one; let its Fourier development be ao/2+)_(a, cos nx+b, sin nx). 
Then, for any N there exists a function gy(x) which takes the values 
1 and —1 only, such that the first N Fourier coefficients of gy(x) are 
the same as the first NV Fourier coefficients of f(x). 

In general, there will be N+1 discontinuities. If f(x) is even or 
odd, gv(x) will be even or odd. For example, let f(x) =(4—<x)/z, 
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0<x <2; f(—x) = —f(x). Then its Fourier series will be 


2 sin nx 
nN 
2a 
g(x) = +1, g(x)=+1, 
x)=-1, —<2 : 
3 2 3 


Incidentally, one of the inequalities leads to the following result: 
If F(x) isa bounded measurable function in absolute value less than 3, 


fore cos xdx co( 


The method used in proving the above inequalities is as follows: 
Let f(x) be a bounded measurable function in (—7, 7) such that 
| f(x)| <1. Let the Fourier development of f(x) be 

ao 
ry + > (a, cos nx + b, sin nx). 


Consider the expression 
(1) a) = Modo + + + + Mon—10,, + Henan 


where po, #1, ° Men are arbitrary real parameters. The functions 
fn(u, @) can be expressed in terms of the function f(x) by using the 
usual formulae for Fourier coefficients 


1 1 
6, = —f f(x) cos nx dx, b, = —f f(x) sin nx dx. 
us — 
From these formulae and (1), it is clear that 
1 
fn (u, f(x) [wo + ws sin x + cosx+--- 
us 


+ Sin 2x + pm cos nx|dx 


1 


say. 


— 
— 
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Assuming the y’s are fixed, the maximum of f,,(u, a) will be attained 
when the integrand f(x)C,(u, x) is as large as possible. But 
| f(x)C.(u, x) | = | C,.(u, x) | 


since | f(x) | <1. However, if f(x) is taken as signum C,(u, x); that is, 
f(x) =1 when C,(u, x) >0 and f(x) = —1 when C,(yu, x) <0, then 


| f(x)C,(u, x) | | x) | 


The following result is now obvious: 
(2) 0) 2)| dv = 


and the equality holds only when 
f(x) = signum C,(u, x) = gn(u, x). 


For example, if m=1 and yw, =0, formula (2) states that 


1 
Ho@o + S —f fi(x)Ci(u, x)dx 
where 
fi(x) = signum (uo + ye Cos x). 


Assume po+p2 cos x=0 has the real roots +a; then gi(u, x) will 
be defined as follows: 


gi(u, *)= +1, a, 
gi(u, x) = if. if Bo > 0; 
gi(u,x)=—1, 
gifu, x) = +1, { if po < 0; 


and 


1 
f | Ci(u, x)| dx = + —f gi(u, x)Ci(u, x)dx 


1 
= + f Ci(u, x)dx +f Ci(u, x)dx 


+f - Ci(u, | 
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= + — [po(4a — 2) + wet sina] = + =) + pe sin a. 


If uo is positive, the upper sign holds; if uo is negative, the lower sign 
is used. 
But C,(u, =0; that is, wo+pe cos ay =0 or 


cos a = — po/p2, 
a@ = arc cos (— po/u2), a — = arc sin po/pe, 
: 2 1/2 
sin = (1 — wo/us) 
so that 
4 e 2 1/2 
D,(u) = + — [wo arc sin + — 10/2) |. 
Therefore 
4 ° 2 2.1/2 
(3) Hodo + weds S + — [wo arc sin (uo/u2) + — ] 


where dp» and a, are the Fourier coefficients of any bounded function. 
It is easy to see that the equality sign holds only when do and a; equal 
the Fourier coefficients of g:(u, x). 

Similar results hold’ for any m. Asume C,(u, x) =0 has 2” real 


roots a; where --- Sae,S7. Call and 
= 

Assume C,(u, x) is positive between a and a,. Then g,(u, x) will be 
defined as follows: gn(u, x) =(—1)', 2n+1. 


If C,(u, x) is negative, then g,(u, x) =(—1) a; <x Therefore, 
1 2ntl pai 

so that from (2) 

(4) Modo + + pedi + + + S D,(u). 

Let 
(5) a; = b; = %ai-1. 
Consider the 2n+1 dimensional space of points whose coordinates are 


the x; (j=0, 1, 2,---, 2n+1). The inequality (4) can be written as 


2n+1 


D Hix; D,(u). 


j=0 
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This implies that, for any bounded measurable function, the point P 
whose coordinates are given by (5) must lie on one side of the hyper- 
plane whose equation is 


2nt+1 


(6) D = 


The point P will lie on the plane if (4) becomes an equality. This 
will occur when and only when f(x) has its first 2n+1 Fourier coeffi- 
cients equal to those of g,(u, x). 

Since the uw; were arbitrary, the same result holds for any set 
Mo, Mi, » Therefore, the point P, whose coordinates are (xj), 
j=0,1,---,2n+41, must lie on one side of every plane in the family 
of planes defined by the equation (6). 

The previous example will clarify this. Equation (3) implies that 
if, for any f(x), ao is plotted as the x coordinate and a; as the y co- 
ordinate of a point, then such points will lie on the origin side of the 
straight line whose equation is 


Hox + poy = D,(u). 


The point will lie on the straight line if and only if f(x) has its Fourier 
coefficients a) and a; equal to those of gi(u, x). 

Since the y's were arbitrary, the same result holds for any set po, pe. 
Therefore, the point P, whose coordinates are (ao, a1) must lie on one 
side of every line in the family of straight lines defined by the equation 


Hox + = D,(u). 


This implies that the point P will lie inside the region enveloped by 
these straight lines. 

The usual procedure in finding the equation of this envelope is, 
first, to differentiate partially with respect to the parameter, and sec- 
ond, to eliminate the parameters between the resulting equation and 
the original equation. The partial differentiation results in the follow- 
ing two equations: 

oD oD 2 21/2 
— = + (4/n) arcsin (uo/u2), y = — = + (4/4)(1 — 
Ope 

After the elimination of the parameters, the following equation is 

obtained for the envelope: 


y = (4/x) cos (rx/4). 


This implies that 


— 
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a; S (4/r) cos 


Equality will hold, as before, if f(x) has the same first two Fourier 
coefficients as any function gi(u, x). 

In the general case, the equation of the envelope is obtained by the 
same method. When formula (6) is differentiated partially with re- 
spect to the parameters, the following equations result: 


aD,(u) 
On; 


j=0,1,-++, 


But 


9D, (x) 1 @ [ 


Ou; Op; 


i=0 é 
where the a;, the roots of C,(u, x) =0, also depend on the y;. Now 


On; a; ay On; 


0a; 
+ (— 1)*C,(u, = a) — 
Ou; 


7 


2 


by the standard rule for differentiating under the integral sign. 
However C,(u, ai41) = Ca(u, ai) =0 and 


x) 
——— = sin (j + 1)x/2 if 7 is odd, 
On; 
= cos (jx/2) if 7 is even, 

so that 

1 2n+1 

f (— 1)‘ sin (j + 1)x/2dx if 7 is odd, 
a T i=0 a; 
(7) 1 2n+1 @i+l 
=+— > (— 1)‘ cos (jx/2)dx if 7 is even, 
T i=0 a; 


where the signs are either all positive or all negative. 

Instead of eliminating the parameters y, it is simpler to eliminate 
the parameters a, thus obtaining the equation of the envelope 
@,(x;) =0 which implies the inequality 


(8) ¢nx(a, 6) 0. 


Since ¢,(x;) is obtained as the envelope of planes, it is obvious that 
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the region bounded by it is the convex hull of a family of half-spaces. 
Again, the inequality will hold if and only if f(x) has its first 2n+1 
Fourier coefficients (ao, a1, bi, - - - , dn, bn) equal to the corresponding 
Fourier coefficients of any one of the functions g,(u, x). 

A few illustrations will clarify this method. Let n=2 and po=p2 
=p4=0. Then the a; will be the roots of the equation 


fi sin x + yz sin 2x = 0, 


and from formula (7) 
1 @it+i 
= +—>f (— 1)‘ sin «dx = + (— cos a3), 
a; 
since a, = —a3, and 
1 @i+l 
m=+t+— (— sin 2x dx = + (2/x)(1 — cos 2a3). 


Theretore or according as 
the plus or minus signs are used. But since x; was the coordinate for 
b; and x; for be, the following inequality results: 


(xb,/4)? + bs| /4 1. 


From the result proved above: that the Fourier coefficients of f(x) 
satisfy certain inequalities or, stating it geometrically, lie in certain 
convex regions, the approximation theorem follows. The theorem can 
be stated as follows: 

Let f(x) be a real, bounded, measurable function in absolute value 
fess than one. Let its Fourier development be 


ado 


+ (a, cos nx + 6, sin nx). 


Then for every N, there exists a step-function gy(x) which takes on 
the values of +1 and —1 only, such that the first N Fourier coeff- 
cients of gy(x) are the same as the first N Fourier coefficients of f(x), 
that is, 


1 1 

a, = —f gv(x) cos mx dx, b, = —{ gv(x) sin nx dx, 

1, 

Again, let a; =x3,, b; =x3,_,. Then, as before, the point P’ whose co- 

ordinates are , X2y) must satisfy the inequality (8): 
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on(x}) <0, or, geometrically, the point P’ must lie inside the convex 
region bounded by the hypersurface whose equation is 


on(Xo, M1, 5 xen) = 0. 


Suppose now that P’ lies on the hypersurface, that is, its coordi- 
nates satisfy the equation 


, 
On(%0, X1, X2,° °°, Xev) = O. 


Since ¢y(x;) =0 was obtained as the envelope of the family of hyper- 
planes represented by equation (6), )>u;=Dwy(u), there will be a 
plane of the family tangent to the surface at P’. But since P’ lies 
on this plane, whose equation can be written 


+ + mote +--+ + = Dy(p), 


its coordinates must equal those of the function gy(u’, x). Therefore, 
gn(ud, ~~~ , Man, x) is the function described in the theorem. 

If P’ lies inside the hypersurface, consider the points in 2N-+3 di- 
mensional space whose coordinates are x1, Xan» Xen 41, 
Xenv+3) where x2yv41 is an arbitrary variable. They will lie on the straight 
line through x1, x7, X2y, 0,0) perpendicular to the hyper- 
plane Xen+1 = Xen+2 = 0. 

Since @y4:(x;) =0 is a closed convex surface for all n, the above 
straight line will intersect it in some point whose coordinates 
are (x¢, x1, Xey, X3v42)- Just as before, there will 
exist a step-function gy4:(x) whose Fourier development is ag /2 
+)°(a,/ cos nx+b,/ sin nx) and such that a/ =x3,, bf =x,_,;71=0, 1, 
2,---,N+1. 

This function satisfies the conditions of the theorem for 


= = Gj, Xoi-1 = 5; = 


UNIVERSITY OF WISCONSIN 


THE GENERALIZATION OF A LEMMA OF M. S. KAKEYA 
J. GERONIMUS 
We shall prove the following: 
Lemma. It is always possible to find the unique polynomial 


28 
$*(2) = 
k=0 


of degree 2s possessing the following properties: 
o*(z) = ci*(z)r(z)r*(z), = const., 
the polynomial i(z) of degree o Ss having all roots in the domain | z| po 
i(z) = [[ @- a), | a;| > 1, 
i=1 
and the polynomial r(z) being of degree v=s—ca: 
1 
r(z) =|] — a), T*(s) = 27 (-) = [J (1 — 2a). 
i=1 i=1 
II. It ts subject to the conditions 
28 
w;(¢*) = vice = d;, 0, 
k=0 


the given linear functionals w; being such that every polynomial $(z) of 
degree n=2s for which 


2s n 
(i) : k 
wi(d) = = 0, (i 0, i 5), = YK, 
k=0 k=0 
has s+1 roots at least in the domain |z| sol 
In the particular case when 


wild) = o(z:), | <4, 


this lemma has been proved by M. S. Kakeya [1];! without being 
aware of his result we have proved this lemma in the case? 


1 Numbers in brackets refer to the bibliography at the end. 
2 In [1] and [2] one may find the application of this lemma to some extremal prob- 
lems. 
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(¢) =(=*) i= 0,1 


In order to prove this lemma in the most general case we consider the 


following extremal problem: 


PROBLEM. To find the minimum of the integral 
2r 
(1) = f | elt, 
0 


t(z) being the given polynomial of degree s with t(0) 0 and b(6) being 
a trigonometric polynomial of order n=2s: 


5(6) = (-)} R> z= 
k=0 
subject to the conditions* 
2s 
(i) 
(2) wi(b) = wi(¢) = = di, 
k=0 
The fundamental property of our functionals w; yields at once that 
every trigonometric polynomial 5(@) subject to the conditions 
wi(b) = 0, i=0,1,---,5, 
has in (0, 27) no more than 2(n—s—1) changes of sign. It is clear 


that there exists a solution of our problem. Further, the necessary 
conditions for an extremum are 


sgn | ¢(z) |? = R > Anz*, s =e, 
k=n—2s8 


whence we find at once that the Fourier expansion of sgn b*(@) is of 
the form 


sgn 6*(6) = R B,z*, =e", 
k=n-8 


We have shown in [2] that every trigonometric polynomial with this 
property must be of the form 


b*(0) = R{ } (z)7 (1/2), s= 


q(z) being a polynomial of degree o Ss all of whose roots lie in the 
domain | z| <i, and r(z) being a polynomial of degree v=s—ca. 


3 The functionals w; are the same as above. 
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The polynomial 6*(6) for which the minimum is attained is unique. 
If there were two such polynomials, b*(@) and 6#(@), then we would 
have 


b* + 
Lis) < < L(b#); 


then b*(@) and 5(@) would change sign at the same points, that is, 
the polynomial 
bi#(0) — b#(0) = |? — ro(z) |2}, = 
would have at least 2(m—v) changes of sign in (0, 27); but since 
wi(b* — = 0, 4=0,1,---,5s, 


the polynomial b*(@)—b#(@) cannot have more than 2(n—s—1) 
changes of sign in (0, 27); this contradiction proves the unicity of the 
polynomial solving our problem. Thus we find that there exists the 
unique polynomial b*(@) minimizing (1) under conditions (2) and it 
must be of the form 
b*(0) = R{ } 

Since the real parts of two polynomials coincide on the unit circle, 
these polynomials are identical, that is, 


whence we find finally 
*(2) = yd + + = 
where 
i(z) = q*(2) = 2°g(1/2). 


Thus we have found the polynomial ¢*(z) satisfying all the conditions 
of our lemma. 
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NOTE ON CERTAIN ORTHOGONAL POLYNOMIALS! 
DUNHAM JACKSON 


1. Introduction. A well known theorem? states that if K,(x, 2) 
=)"*px(x) p(t) is the kernel associated with a system of orthonormal 
polynomials on an interval (a, b) with weight function p(¢), and if xo 
is a real number not belonging to the open interval (a, 6), the function 
K,(x, t) is orthogonal to every polynomial of lower degree with re- 
spect to |t—xo| p(t) as weight function. This result is to be extended 
below to orthogonal trigonometric sums, and more generally to other 
orthogonal polynomials in two real variables on an algebraic curve.* 
The fact that a single polynomial of the mth degree in the original 
formulation is replaced by two or more sums or polynomials of like 
degree in the generalized orthogonal systems imparts to the extension 
some features of novelty. Certain other generalizations are briefly 
mentioned also. 


2. Trigonometric sums. Let Up(x), Ui(x), Vi(x), - -- be orthonor- 
mal trigonometric sums for weight p(x), and let 


K,, (x, = Uo(x)Uo(s) + Ui(x)Ui(s) + Un(x)U2(s) 
+ Vi(x)Vi(s) + --- + 


For definiteness it may be assumed that U;(x) contains no term in 
sin kx, while V;.(x) contains sin kx with a nonvanishing coefficient; the 
function K,(x, s) would be unchanged if U;, Vi were replaced by any 
equivalent pair of sums of the kth order. If T,,(x) is any trigonometric 
sum of the uth order, 


Suppose p(x) =0 throughout an interval (a, B). Let x1, x2 be any two 
distinct points of (a, 8). Let t,4(x) be an arbitrary trigonometric 
sum of order n—1 at most, and let sin 3(x—x,) sin 3(x—%2)T,1(x) 
=T,(x); the product 


sin — x) sin}(”% — x2) = }.cos4(x, — +2) — }.cos [x — 3(x1 + 


1 Presented to the Society, December 29, 1939. 

2 See, e.g., G. Szegé, Orthogonal Polynomials, American Mathematical Society 
Colloquium Publications, vol. 23, New York, 1939, p. 39. 

3 See D. Jackson, Orthogonal polynomials on plane curves, Duke Mathematical 
Journal, vol. 3 (1937), pp. 228-236. 
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is a trigonometric sum of the first order. Then 


(1) f s) sin — 2%) sin 3(s — x2)rn_1(s)ds 
= sin — x) sin — 


The right-hand member vanishes for x =x, and for x=x2; that is to 
say, K,(x1, s) and K,(x2, s) as trigonometric sums of the nth order in s 
are orthogonal to every sum Ta-r(s) of lower order, with respect to 
sin 3(s—x1) sin 3(s—x2)p(s) as weight function. This weight function is 
of constant sign, since p(s) vanishes throughout the interval (x1, x2). 

The sums K,(x1, s) and K,(x2, s) are linearly independent. An iden- 
tity K,(xe, s)=CK,(x, s) would require in particular that Up (xe) 
=CUo(x1), Vilxe)=CVi(x1), that is, that C=1, 
COS X2=COS X1, SiN x2=sin 

In the limiting case x2=%1, 


f p(s)K,(x, s) sin? 3(s — = sin? 3(% — 
and by differentiation with respect to x under the sign of integration 


fro s) sin? $(s — 


= sin 3(x — cos 3(% — x1)tn_1(x) + sin? — 


where K,,(x, s) means (0/0x)K,(x, s). Both right-hand members van- 
ish for x=%1; K,(x, s) and K,:(x1, s) as trigonometric sums of the 
nth order in s are orthogonal to every sum of lower order for weight 
sin? 3(s—x,)p(s). Here no assumption with regard to vanishing of p 
is required. It is obvious that K, and K,,: are linearly independent, 
for one contains a constant term and the other does not. 

Vanishing of p is again unnecessary if x; and x2 are allowed to be 
complex. Let x.=y+26, x2=y—16, with y and 6 real, 


sin 3(x — x) sin 3(x — x2) = 4 cos i6 — 3 cos (x — y) 


| 


cosh 6 — 3 cos (x — y). 


‘Since each member of (1) represents a trigonometric sum in x, and 
they are identical for real values of x, they are equal also when x is 
complex. For x =x, the right-hand member is zero, and the real and 
pure imaginary parts of the expression on the left must vanish sepa- 
rately. The same result is obtained by setting x =x. The real and pure 
imaginary parts of K,(x1, s) for real s (or of its conjugate K,,(x2, s)) are 
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orthogonal to every trigonometric sum of lower order in s with respect 
to the non-negative weight function [3 cosh 5—} cos (s—v7) ]p(s). 
These real and pure imaginary components, being linearly independ- 
ent combinations of the linearly independent functions K,(x1, s) and 
K,,(x2, s), are themselves linearly independent trigonometric sums of 
the mth order in s. 


3. Polynomials on an algebraic curve. Orthogonal trigonometric 
sums can be regarded as a special instance of orthogonal polynomials 
in two real variables on an algebraic curve, the curve in this case being 
a circle (or, without essentially greater generality, an ellipse). Con- 
siderations analogous to those outlined above apply to the more gen- 
eral problem. As an illustration the reasoning will be formulated for a 
curve of the third degree. It will be immediately apparent that the 
same type of argument can be applied to curves of higher degree, if 
exceptional cases are avoided, or, on the other hand, to an alternative 
treatment of orthogonal polynomials on a circle, or to orthogonal 
polynomials on hyperbolic arcs. (Orthogonal polynomials on a parab- 
ola are essentially orthogonal polynomials in a single variable.) 

Let C be a curve of the third degree. Let p(x, y) be a non-negative 
weight function defined on C, and, for simplicity, vanishing identi- 
cally except on an arc or arcs of finite extent. Let poi(x, y), pu(x, ¥), 
Pu(x, y), and, for pri(x, vy), ¥), Pns(x, y) denote the mem- 
bers of a corresponding system of orthonormal polynomials on C, each 
polynomial being of total degree indicated by its first subscript. Let 


kSn 


the summation being extended over all the relevant pairs of sub- 
scripts. As in the special case of trigonometric sums, the function K, 
is completely determinate when the points (x, y) and (u, v) are on the 
curve C in their respective planes, although the individual polyno- 
mials of any specified degree in the system admit orthogonal trans- 
formation among themselves. 

Let ax+by+c=0 be the equation of a line intersecting C in three 
distinct points (x1, yi), (x2, ye), (xs, ys), and so situated that the do- 
main of orthogonality of the polynomials is all on one side of it, that 
is, so that ax+by-+c is of constant sign where p+0. The intersections 
may be real, or two of them may be conjugate complex; it is under- 
stood that the coefficients a, 5, c, as well as those in the equation 
of C, are real. It is to be understood further, if C is degenerate, that 
the domain of orthogonality includes a portion of positive measure 
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(with respect to arc length as variable of integration) on each of the 
loci defined by the vanishing of an irreducible factor in the equation 
of C. 

If P,(x, y) is any polynomial of the mth or lower degree, then for 
(x,y) on C 


f p(u, v)K,(x, y, u, v)P,(u, v)ds = P,(x, y), 
c 


where s denotes arc length on the curve C in the (u, v) plane. While 
the identity holds in general only on the curve, it is valid at all points 
of the curve, not merely where p <0. If 1,_1(x, y) is an arbitrary poly- 
nomial of degree »—1 at most, and P,(x, y) =(ax+by+c)m,1(x, y), 
the identity states that 


(2) f p(u, v)Kn(x, y, v)(au + bv + v)ds 
= (ax + by + c)mn-i(x, y). 


Since the points (x,, y,), »=1, 2, 3, lie on the curve and at the same 
time on the line ax +by+c=0, the relation holds with the right-hand 
member equal to zero if any one of these pairs of values is substituted 
for (x, y). The functions K,(x,, y,, u, v) as polynomials in u and v 
are orthogonal to every polynomial of lower degree for weight 
(au-+bv+c)p(u, v). 

The three polynomials K,(x,, y,, u, v) are linearly independent on C 
if n=2. For, as the polynomials ;,;(u, v) are linearly independ- 
ent on the curve, a relation of linear dependence of the form 
u, =0 would imply corresponding relations of linear 
dependence among the quantities p;;(x,, y,) as coefficients in the K’s. 
In particular, the first six of the equations >-,C,p:;(x,, y») would re- 
quire that 


Ci+C2+C; =0, 

Cixi + Coxe + = 0, 

Ciyi + Cove + Csys = 0, 

+ Coxs + = 0, 
Cixiyi: + Coxeye + Caxsys = 0, 
Ciyi + Coys + Cays = 0. 


The first, second, and fourth of these equations are possible with C’s 
not all zero only if two of the x’s are equal, and the first, third, and 
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sixth only if two of the y’s are equal. But this is inconsistent with the 
hypothesis that (x1, 41), (x2, y2), (x3, ys) are three distinct points on 
a straight line. Similarly, though more simply, any two of the poly- 
nomials K,(x,, y,, u, v) are linearly independent on C. The expressions 
K,,(x,, ¥», U, v) give the complete system of orthogonal polynomials 
corresponding to the weight function (ax+by+c)p(x, y). If two of the 
points (x,, y,) are conjugate complex, real polynomials for the orthog- 
onal system can be obtained by taking the real and pure imaginary 
parts of the corresponding K’s. 

Similar reasoning is clearly effective for a curve of the Nth degree 
with a straight line meeting it in N distinct points. For a treatment 
of certain cases of coincident points of intersection the discussion will 
be limited to the curve of the third degree. 

Suppose now that the line ax+by+c=0, instead of meeting the 
curve in three distinct points, is tangent to it at an ordinary point 
(x1, ¥1), and intersects it at a point (x2, ye) distinct from (x, 9:1). It is 
still assumed that the part of the curve on which p0 is all on one 
side of the line, that is, that ax+by+<c is of constant sign there. At 
least one of the coefficients a, b is different from zero; without loss 
of generality it may be supposed that 50, that is, that the line 
is not parallel to the y-axis. In the neighborhood of the point (x1, 4:1), 
y is a single-valued function of x on the curve, with a derivative whose 
value reduces at (x:, 1) to (—a/b), the slope of the tangent line. In 
this neighborhood on C, by differentiation of (2), 


ff v)Kaz(x, y, v)(au + bv + v)ds 
c 
d 
= — [(ax + by + 9) J 
dx 
d 
(ax by + c) y) (a + by’)n-1(x, y), 


where K,,.(x, y, u, v) means the total derivative of K, with respect 
to x, equal to 0K,/dx+~y’0K,/dy. At (x1, y1) both ax+by+c and 
a+by’ vanish. The function K,2(x:, y1, u, v), which is still a polyno- 
mial as regards its dependence on u and 2, is orthogonal to 7, 1(u, v) 
for weight (au+bv+c)p(u, v). 

There have been obtained then for 22 three polynomials 
K,.(x1, v1, u, 0), Knz(x1, y1, U, v), Kn(x2, ye, u, v), each possessing the 
property of orthogonality. These are once more linearly independent 
on the curve. Without effect on the form of K,, and so without any 
impairment of generality, it may be assumed that the first six poly- 
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nomials in the orthogonal system are specifically those obtained by 
applying the Schmidt process to the monomials 1, x, y, x?, xy, y?, taken 
in that order. Then pu (x, y) does not involve y, and x(x, y) involves 
neither xy nor y?. Suppose there were an identity of the form 


CiK, (x1, U, 2) + C2K,:(%1, 41, U, + C3K,(%2, Y2, U, 2) =0 
with C’s not all zero. Then 
Cipei( x1, 91) + Caper, 91) + = 0 


for each pair of indices (k, j) in which kn, the accent indicating 
total differentiation with respect to x. The equations for the first four 
pairs of indices may be simplified to the form 


Ci + C3 = 0, 

Cix1 + C2 + Coxe = 0, 
C2a/b + Csye = 0, 
Cixi + + Coxe = 0. 


The first, second, and fourth of these equations require that x:=%, 
which is impossible under the hypothesis that (x1, yi) and (x2, ye) are 
distinct and 60. It is immediately apparent that any two of the 
three K’s are linearly independent for »=1. So the K’s give the com- 
plete orthogonal system for weight (ax+by+c)p(x, y). 

Finally, let the line meet the curve at a double point (x1, yi) at 
which there are distinct tangents with finite slopes, and at another 
point (x2, ye). It is assumed, as always, that ax+by-+c is of constant 
sign where p does not vanish. Let Ki(u, v) and K2(u, v) be the total 
derivatives K,:(x1, yi, u, v) calculated respectively for the two 
branches of the curve through (x, 1); let the corresponding slopes 
be A; and Az. Then 


Cc 


(a + bry) (41, = 1, 2. 


It is seen that (a+b\:2)Ki(u, v) —(a+bd1)K2(u, v) is orthogonal to 
Tn-1(u, v) for the composite weight function. This polynomial and the 
two polynomials K,,(x1, 0), Kn(x2, y2, u, v) are found to be linearly 
independent on C for n=2, and once more a complete orthogonal sys- 
tem is obtained. 

These illustrations will be allowed to suffice for the case of multiple 
intersections. 
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4. Other cases. The corresponding theory for a non-algebraic curve 
is complicated by the fact that the number of polynomials of the mth 
degree in the orthogonal system increases with m, but is compen- 
satingly simplified by the observation that the representation corre- 
sponding to (2) holds for all values of x and y, so that (except for the 
assumption that the domain of orthogonality is all on one side of the 
line) the points where the straight line meets the curve are no longer a 
matter of special concern. If (x,, y,) are any n+1 distinct points on 
the line, independent polynomials of the mth degree orthogonal to 
every polynomial of lower degree with respect to the composite 
weight function are given by K,(x,, y,, u, v). 

A similar conclusion holds for orthogonality on a two-dimensional 
region. 
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NOTE ON AN INEQUALITY OF STEINER! 
T. RADO AND P. REICHELDERFER 


Let Q denote the unit square 0<x, y <1. If f(x, y) be any function 
defined and continuous on Q, the relation z=f(x, y) yields a continu- 
ous surface defined over Q. The Lebesgue area? of this surface will be 
denoted by L(f). Let z=fi(x, y), z=fe(x, y) be two continuous surfaces 
defined over Q; then clearly z= [f,(x, y) +f2(x, y)]/2 is a continuous 
surface defined over Q. The imequality of Steiner® states that 
L([fitfe]/2) [L(ft) +L |/2. McShane‘ obtained interesting and 
important results concerning the situation where the sign of equality 
holds in this relation. In this note we improve his results and, in a 
sense, give them a final form. 

In order to emphasize and to clarify what is significant and inter- 
esting in the results of McShane and in our improvements thereon, 
we remind our reader of a few facts concerning the Lebesgue area.® 
Given a continuous surface z=f(x, y) defined over Q; if L(f) is 
finite then the partial derivatives f, and f, exist almost everywhere 
in Q, the integral [fo[1+/2+/3]"*dxdy exists, and the relation 


1 Presented to the Society, April 13, 1940. 

2 See S. Saks, Theory of the Integral, Warsaw and Lwéw, 1937, chap. 5, for the 
facts used in this paper concerning the Lebesgue area. 

3 E. J. McShane, On a certain inequality of Steiner, Annals of Mathematics, (2), 
vol. 33 (1932), pp. 125-138. 

* Loc. cit.* 

§ 
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Sfelttfi+fz]dxdy SL(f) is valid. The sign of equality holds in 
this last relation if and only if f(x, y) is absolutely continuous in the 
sense of Tonelli. Next, if a sequence of continuous functions f,(x, y) 
converges uniformly on Q to f(x, y), then it is always true that 
lim inf, ... L(f,) =>L(f), but the sign of equality holds in this last rela- 
tion only if the sequence f,(x, y) is chosen with extreme care. 

Suppose z=f,(x, y), z= f2(x, y) are two continuous surfaces defined 
over Q such that L([fi:+f2]/2) = +L (fz) |/2. If we assume, for 
the moment, that both fi(x, y) and f2(x, y) are absolutely continuous 
in the sense of Tonelli, then so is [fi(x, y) +f2(x, y) ]/2, and the preced- 
ing equality is equivalent to 


[1+ fis + fis) "+ [1 + feet 
+ fis] +f 


(1) 


Using the inequality of Schwartz one sees that the integrand in (1) 
is never negative; thus the integrand is equal to zero almost every- 
where in Q, which implies that fi:=fe:, fiy=fey almost everywhere 
in Q—that is, [(fiz—fer)? + (fiy —fey)?]!/2=0 almost everywhere in 

Now suppose we discard the assumptions that f; and fz are abso- 
lutely continuous in the sense of Tonelli; then, of course, the pre- 
ceding reasoning yields no result. Still McShane proves that 
[ (fiz almost everywhere in Q. Indeed, he 
proves a sharper result to the effect that, when [L(fi)+L(f2)]/2 
—L([fitfe]/2) is small, then [(fiz—fer)?+(fiy—fey)?]"? is small in 
measure, so to speak—dquite exactly, he establishes a contrapositive 
statement :7 If the functions f(x, y), fo(x, y) are defined and continuous 
on a square Q and there exist positive numbers G, €, 5 such that 

(1) Lift) $G, Life) $G; 

(2) [(fiz—fez)?+(fiy—fey)?]/22=€ on a set of measure at least equal 
to 5; 
then 


L([fi + fe]/2) [L(ft) + L(fz)]/2 — uG, «, 8), 


where pu ts a positive number whose value depends only on G, e, and 6. 
We improve this result by replacing smallness with respect to 


6 From this it follows (loc. cit.,3 Theorem I) that fi—f2 is constant on Q, but we 
shall not be concerned with this fact. 
7 Loc. cit.,? Theorem IV. 
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measure by smallness with respect to exponent. First, we show that, 


af fi(x, y) and fo(x, y) are two continuous functions defined on the square 
Q for which L(f,) and L(fz) are both finite, then 


{ f J, — fas)? + (fy fu" 


(2) 


Since the Hélder inequality reveals 


{ [(fiz — fos)? + (fy — 


to be a non-decreasing function of X, it follows that, for every expo- 
nent A satisfying 0<A 1/2, it is true that 


SS. [(fis — fos)? + (fv — fu)? 2p 


L(fs) + _ 


(3) 
< (Lif) + : 


Next, we prove that, under the same assumptions, it follows that 


SS, [(fis — fos)? + — fu)? 2/r 


(4) 
< (Lif) + 


for every exponent d satisfying 1/2 =\ <1. The reader will observe that 
inequality (4) reduces to (2) for \=1/2. 

Let, now, f(x, y) be a sequence of continuous functions converging 
uniformly on Q to f(x, y); assume that L(f,) and L(f) are finite, and 
that L(f,) converges to L(f). Under these hypotheses McShane proves 
that [(f.2:—f2)?+(fay—fy)*]/2 converges to zero in measure. We im- 
prove this result by showing that under these hypotheses it is true that 


no 


(5) f [faz — + — = 0 


for every exponent d satisfying 0<X<1. Since McShane has shown 


8 Loc. cit.,3 Theorem V. 
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that the relation (5) is not generally true for the exponent \ = 1,° our 
result is the best obtainable under the given hypotheses. 

In fact, we shall give two proofs for relation (5). One proof will 
reveal this result as an immediate consequence of inequalities (3) and 
(4); the other will depend upon a general lemma concerning the Le- 
besgue integral. 

We proceed presently to the proof of inequality (2). Given two 
continuous functions fi(x, y), fe(x, y) defined on Q for which L(f,), 
L(fz) are both finite; then fiz, fiy; fez, fey exist almost everywhere in Q. 
Set 

(fiz fez)? + (fiy fay)? v=1 + fizfez + fiyfey; 


1/2 1/2 


w= [1+ fist fel: 
w= [1+ ([fiz + + + fev) 
a=(w+ w2)/2—w; B= (wit we)/2+ wv; 


Y = WiWe — 56 = wywe + »v. 


The reader will easily verify the following identities: 

(6) af=y/2, 

(7) y+ 6 = yb =u + (fisfey — fasfiy)*. 

Now (7) implies that y 20, 520, whence it follows that a-820, and 


that 
5 S 2wyw. S (wi + we)?/2. 


In view of these relations (6) implies that a 20, 8 20; hence 6 Sw+ 2. 
So the reader sees that 


(8) u? y5 = < (wi + w2)*{(wi + w2)/2 — w}. 

As we observed above, it is true that 

(9) ff wdxdy < L(fi), = i: Zi 
Q 


A reasoning of McShane!® shows that 


(10) Sf {(wi + we)/2 — w}dxdy 
< [L(f) + L(fs)]/2 — L( [fi + fe]/2)- 


Using the Hélder inequality the reader will speedily verify that in- 
equality (2) follows from (8), (9), and (10). 


® Loc. cit.,3 Theorem VI. 
10 Loc. cit.,3 p. 129. 
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We proceed next to establish inequality (4). It follows by the tri- 
angle inequality that u <w:+w»; hence we have from (9), 


Q 


Suppose that i: is a positive number and F(x, y) is a non-negative 
function such that F* is summable on Q. If \; be any positive number 
less than Ag, it follows by the Hélder inequality that F™ is also sum- 
mable on Q. Now every number J satisfying \; S\ S)-2 may be written 
in the form where 0Sa1, a2<1, a1 +a2=1. From the 
Hdlder inequality we have 


The reader will verify that (4) follows from (2), (11), and (12) with 

Now let f,(x, y) be a sequence of continuous functions converging 
uniformly on Q to f(x, y); assume that L(f,) and L(f) are finite, and 
that L(f,) converges to L(f). We first show that 


+ L 
2 2 
Since (f.+f)/2 converges uniformly to f on Q, it follows that 
(14) lim inf L (4~) > Lif). 


But lim, .. [L(f,) +L(f) ]/2=L(f), and by the inequality of Steiner it 
follows that L([f.+f]/2) [L(fn) +L(f) ]/2. Thus 


(15) lim < Lif). 


n— 2 


From (14) and (15), the assertion in (13) follows. 
Now the inequality (3) shows that, for 0<A 31/2, it is true that 


= +1 n +L d/2 


(16) 


while inequality (4) shows that, for 1/2 <\ 1, it is true that 
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(17) 


LU) +L (1-a)/2 


From (13), (16), (17) the relation (5) clearly follows. 

In order to give a second proof of relation (5) we use the following 
lemma.'! Let F,(x, y) be a sequence of non-negative functions defined 
on Q for which there exists a positive number \o such that (1) the functions 
F,, converge to zero in measure; (2) the functions F® are summable on Q; 
(3) the integrals {{gF}*dxdy are uniformly bounded by a positive number 
M. Then, for every positive number d less than Xo, tt is true that 
limn JfoFrdxdy =0. 

Proor. Given e€>0, since F, converges to zero in measure there 
exists a positive integer m(¢) such that, for every  >n(e) the set E,(€) 
of points (x, y) in Q for which F,(x, y) 2” is of measure less than 
. | Now obviously 


(18) f F.dxdy = f f Fedxdy + f f Fedxdy. 
Q Q—E,, (€) 


On Q—E,(e) it is clear that Fi(x, y) Se; hence 


(19) ff F.dxdy Se. 
Q-E,,(e) 


Using the Hdélder inequality, we have, for n>n(e), 


f Fedxdy 
E,(¢) 
Ao /Ao 
= Sf F, | ad | < 
E,,(€) E,(e) 


Since ¢€ is arbitrary, the lemma follows immediately from (18), (19), 
(20). 

Now let f,(x, y) be a sequence of continuous functions converging 
uniformly on Q to a function f(x, y); assume that L(f,) and L(f) are 
finite, and that lim,... L(f,) =Z(f). Consider the sequence of functions 
vy) = [(fnz—f2)? + (fny—fy)? In view of the results of McShane 

11 This lemma is an immediate corollary of certain important results of F. Riesz, 


Untersuchungen iiber Systeme integrierbarer Funktionen, Mathematische Annalen, vol. 
69 (1910), pp. 449-497. We give a direct proof for the convenience of the reader. 


(20) 


= 
= 
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stated above, it is clear that this sequence of functions satisfies the 
three conditions of the preceding lemma with \y)=1. Thus, for every 
exponent J satisfying 0<\ <1 it is true that 


lim f — + — fe)? = 0. 


So we have a second proof of our result (5). 


THE STATE UNIVERSITY 


AN ADDITIONAL CRITERION FOR THE FIRST CASE 
OF FERMAT’S LAST THEOREM! 


BARKLEY ROSSER 
In an earlier paper? it was shown that if p is an odd prime and 
a? + 6? +c? =0 
has a solution in.integers prime to p, then 
m?~! = 1 (mod p?) 


for each prime m £41. In this paper the result is extended to m $43. 

We will use the notations and conventions of I throughout, and a 
reference to a numbered equation will refer to the equation of that 
number in I. With » assumed to be an odd prime such that (1) has 
a solution in integers prime to p, we assume that a ¢ exists such 
that the values of (2) satisfy (4), (5), and (6) with m=43. Put 
g(x) =f(x)f(—x) and 


h(x) = (x4? — 1)/(x® — 1). 


Then g(x) divides h(x), and g(x) can be completely factored modulo p. 
Case 1. Assume that a root of g(x) is a root of 


h(x) /(x!? + x8 + + xt + x? + 1). 


Then this root belongs to either the exponent 21 or the exponent 42 
modulo p. Hence p=1 (mod 42). So there is an w such that 


w+towt1i=0. 


Then g(x), g(wx), and g(w*x) all divide h(x). Moreover, the only cases 
in which two of g(x), g(wx), and g(w?x) have a common factor are 
I. a§+1=0, 
II. a6+a?+3a?+3a+1=0, 
III. a®’—a*—3a?—3a—1=0, 
or cases derived from these by replacing a by one of the other roots 
of f(x). So if we show that A(x) has no factor in common with any of 
084-1, x86 or x®—x?—3x?—3x—1, then we can con- 
clude that g(x)g(wx)g(w*x) must divide h(x). 
Clearly h(x) has no factor in common with x*+1. 


1 Presented to the Society, April 27, 1940. 
2 A new lower bound for the exponent in the first case of Fermat's last theorem, this 
Bulletin, vol. 46 (1940), pp. 299-304. This paper will be referred to as I. 
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Suppose A(x) has a factor in common with x*+2'+3x?+3x+1. 
This latter has the factors x?-+x+1 and x*—x*?+2x+1. The first has 
no factor in common with h(x), since it divides x*—1, which has no 
factor in common with h(x). To test the second, we try it successively 
with each of the four factors of h(x), getting the eliminants 


13-19?- 127-1632, 5-36913, 2-127, 5-7. 
Suppose h(x) has a factor in common with x* —x* —3x?—3x—1. This 
latter has the factors x?—x—1 and x*+x°+2x?+2x-+1. The first has 
no factor in common with h(x) by Lemma 3 of I. Trying the second 


factor successively with each of the four factors of h(x), we get the 
eliminants 


73-43, 2?-7-13-43, 7, 43. 


So g(x)g(wx)g(w*x) must divide h(x). Since both are of degree 36, 
they must be equal. Putting b=c+5 and equating coefficients, we get 


A +1 = 2c? + 3c? — 24c + 13 = 1, 
B+i1 = &+ + 42c* + — 9c? — + 173 = 1, 
C+i1= — + 171c* + 132c? — 666c? + 132c + 201 = 1. 
Dividing 16B and 8C by A, we get the remainder 
43D = 43(99c? + 192¢c — 116) =0 


from each. Then 
2cE = 29A + 3D = 2c(29c? + 192c — 60) = 0. 
As c=0 would give A =12=0, we have 
28cF = 15D — 29E = 28c(23c — 96) = 0, 
29cG = 8E — 5F = 29c(8&c — 49) = 0, 
8F — 23G = 359 = 0. 
Case 2. Assume that no root of g(x) is a root of 
h(x) /(x'* +- x! + x8 + + x4 + + 1). 
Then, since g(x) divides h(x) and is of degree 12, 
So 2c+1=1 and c?+5=1. 


PRINCETON UNIVERSITY 


SOME NEW PROPERTIES OF TRANSFINITE ORDINALS! 
SEYMOUR SHERMAN 


1. Introduction. Sierpinski? has stated the following theorem on the 
Cantor normal form: 


Every ordinal A >0 can be represented in the form 


n—1 


A= 
i=0 
where 
o> N, Qo, 21, > 0, 


>an12 0. 


Henceforth in this paper, when an ordinal is written in summation 
form, it will be assumed that the summation satisfies the above-stated 
requirements. 

Elsewhere Sierpifiski? has proved: 


The set of all “divisors on the left” of an ordinal number is closed.‘ 


One of the purposes of this paper is to analyze further, by means of 
the Cantor normal form, the left and right factors of transfinite ordi- 
nals. In addition the Cantor normal form will be used to prove that 
for ordinals A, B, and C 


(A + BYC Ss AC + BC. 


We can also specify the necessary and sufficient condition that the 
equality hold. F. Siecza® has used the Cantor normal form to discuss 


1 Presented to the Society, December 27, 1939. The author is indebted to Professor 
W. A. Hurwitz for his indispensable advice in the preparation of this paper. 

2 W. Sierpifiski, Lecons sur les Nombres Transfinis, Paris, 1928, p. 202; hereafter 
referred to as Sierpinski (I). The theorem was first stated for ordinals of the first and 
second ordinal class by G. Cantor, Beitraége zur Begriindung der transfiniten Mengen- 
lehre, Mathematische Annalen, vol. 49 (1897), p. 237, and for all ordinals by G. Hes- 
senberg, Grundbegriffe der Mengenlehre, Abhandlungen der Fries’schen Schule, New 
Series 1.4, Géttingen, 1906, p. 587. 

3 W. Sierpifiski, A property of ordinal numbers, Bulletin of the Calcutta Mathe- 
matical Society, vol. 20 (1930), pp. 21-22; hereafter referred to as Sierpifiski (II). 

4 A set of ordinal numbers is said to be closed if it contains the least upper bound 
of each subset. 

5 F, Siecza, Sur l’unicité de la décomposition de nombres ordinaux en facteurs trre- 
ductibles, Fundamenta Mathematicae, vol. 5 (1924), pp. 172-175. 
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factorization of transfinite ordinals, but his problem was different 
from the one considered here. 

2. Left and right factors. We shall now consider left factors. 


2.1. THEOREM. A necessary and sufficient condition that x be a 
left factor of wa; ts that either or x=*ip; 
+075) where p; is a factor of a;and OS jsn—1. 


If 0<x<w*"!, let x Thus Bo<an-1 and —Bo+a;, 
0<j<n-—1, exists.* We have 


m—1 n—1 a-l 
| = > wai, 
i=0 i=0 i=0 


and hence x is a left factor of A. 
If wa;, where a;=p,\;, then 


n—1 7-1 
i=0 


t=j+1 


j-1 n—1 


i=0 i=j+1 
71 n—1 

= >, + w%ia; + wa; = > wid; 
i=0 i=j+1 i=0 


and x isa left factor of A. 
Thus we have proved the sufficiency of the condition. 
Suppose that x =)°7'w%b; is a left factor of A and that x2w*"". 


Then 
m—1 
Ac ( 2. WC; ). 
7=0 i=0 


m—1 


i=0 i=1 


If y:1=0 then 


n—1 
i=0 


where the last expression is a Cantor normal form. Comparing coeffi- 
cients in the right and left ends of the series of equations, we see that 


6 See F. Hausdorff, Mengenlehre, 1935, p. 63. 


1941] TRANSFINITE ORDINALS 113 


for some j, 0Sj<n—1, Bo=aj, bo is a factor of a,, and 
If y:-1>0, then 


n—1 m—1 l-1 I-1 
i=0 i=0 i=0 i=0 

But since x 2w**-', we have By 2an_1, Bo+Y1-1>Qn-1, and thus the 
left and right ends of series of equations could not be equal. From this 
contradiction y;-1=0 and by the preceding argument our condition is 
necessary. 

Since in }-?>)'wia; there is only a finite set of coefficients and since 
each of these coefficients, being finite, has only a finite set of factors, 
there is but a finite set of left factors of the form wip; +) 75} wai, 
p; a factor of a;; that is, there is only a finite set of left factors greater 
than or equal to w**-'. Hence we have proved: 


2.2. CoroLiary. The set of left factors of A =)_"=w a; consists of 
the set of all positive ordinals less than w**— and a finite set of ordinals 
not less than w**-!, 


2.3. CoroLiary. If A => the left factors of A, in order 
of magnitude, form a normally ordered set whose ordinal number is 
7(a,), where r(a;) is the number of factors of a;. 


2.31. COROLLARY. The ordinal number of the normally ordered set of 
left factors of a positive ordinal is either a finite ordinal or the sum of an 
additively indecomposable’ ordinal and a finite ordinal. 


We can now obtain the previously stated result of Sierpifski,* who 
makes no use of the Cantor normal form: 


2.4. COROLLARY. The left factors of A form a closed set. 


The following corollary has been proved by Hausdorff® and Ben- 
nett!® by means of the euclidean algorithm. 


2.5. COROLLARY. Any two ordinals greater than 0 have a greatest com- 
mon left factor. 


We shall prove this by means of the preceding corollary. Let A >0 


7 An ordinal a>0 is said to be additively indecomposable when there do not exist 
ordinals 8, y such that B<a, y<a, and a=8+7. See Sierpifiski (I), p. 178. 

8 Sierpifiski (II). 

® F. Hausdorff, Grundziige der Mengenlehre, Leipzig, 1914. 

10 A. A. Bennett, Some arithmetic operations with transfinite ordinals, American 
Mathematical Monthly, vol. 28 (1921), pp. 427-430. 
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have a set of left factors, &%, and B>0O have a set of left factors 8. 
AB, the intersection of % and %, has at least one element, 1. The least 
upper bound, c, of 4B is a member of Y% and a member of B, since 
both % and $ are closed. Hence ¢c is a member of AB; that is, AB has 
a maximum element. Thus ¢ is the greatest common left factor of A 
and B. 

The development in case of right factors is somewhat analogous. 
The following can easily be seen to be true. 

2.6. THEOREM. If A =>-?)w%a;, then a necessary and sufficient con- 
dition that x be a right factor of A is that either x=) 779 (w-8°t%)aj, 
where Bo <Qn-1, or x=) J=)(w~%*™)a;+r;, where r; is a factor of aj. 

Since each exponent of the Cantor normal form has a finite set of 


residues and each coefficient has only a finite set of factors, we can 
state the following :"! 


2.7. THEOREM. The set of all right factors of any positive ordinal is 
finite. 


Sierpifiski has proved some of these theorems by using merely the 
properties of additively indecomposable ordinals. The use of the 
Cantor normal form not only yields more extensive results in the case 
of left factors, but also provides a unified approach to other problems 
in the theory of transfinite ordinals. 


3. The distributive laws. Among transfinite ordinals the distribu- 
tive law is always valid for one order of multiplication 


C(A + B) =CA+CB, 
but not always for the other order of multiplication 
(A + B)C ? AC + BC. 
We shall prove: 
3.1. THEOREM. (A+B)CSAC+BC. 


If either A, B, or C is zero, then our theorem holds. Consider A >0, 
B>9, and C>0, where 
n—1 m--1 


fits w%ia; = way + A’, B= wih; = wh, + B’, 


i=0 i=0 


I-1 
C= 
i=0 


" Sierpinski (1). 
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C is either finite, transfinite and limiting, or transfinite and non- 
limiting. 
Suppose C is finite and 0<C<w. If a9 <p, then 
A+ B=A + why + B’ = B, (A + B)C = BC = AC + BC. 
If a9 =Bo, then 
A + B = w(ag + bo) + B’ 
and 
AC + BC = wa + A’ + wbhoC + B’ 
= w(do + bo)C + B’ = (A + BIC. 
If ao then 
A + B= way + A’ + + B’, 
and 
(A + B)C = + A’ + why + B’ = AC + BS AC+ BC. 
Thus if 0<C<w, then 
(A + B)C S AC + BC. 


Suppose C is transfinite and limiting, then y;1>0. Let J=max 
(A, B); then 


(A + Ss J+ = = J(2C) = JC AC + BC. 
Suppose C is transfinite and non-limiting, then y:_1=0, and if we 
take 
1-2 
Co = Ci = Cir, 
then 
Co + Ci = C. 
If ao =Bo, then 
(A + B)C 


(A + B)(Co + Ci) = (A + B)Cy + (A + BC, 
ACy + AC, + BC; < AC + BC. 


TAN 


If QA <Bo, then 
(A + B)C BC AC + BC. 
Thus if =0, then 
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(A + B)C Ss AC + BC, 


and our theorem is proved. 
We can also prove the following: 


3.2. COROLLARY. A necessary and sufficient condition that 
(A + B)C = AC + BC 
for positive A, B, and C is that either C=1, or 1<C<w and a<fp, 
or wSC and 


This corollary follows quite easily from the reasoning found in the 
preceding section. 


CoRNELL UNIVERSITY 


THE DECOMPOSITION THEOREM FOR 
ABELIAN GROUPS! 


JOEL BRENNER 


Let G be an abelian group such that p*g=0 for all gEG, p prime, 
k fixed. We prove G has a basis, that is, a set of elements such that 
each g€G is uniquely expressible as a linear combination of elements 
of the set.” 


THEOREM. There exists an ascending chain of sets B;, 0 <1 SR, of ele- 
ments of G with the properties: 

(i) Every element in B; is of order greater than p*~‘. 

(ii) The elements in B; are completely linearly independent. 

(iii) If the order of the element g in G is greater than p*~‘, then there 
exists a (unique) linear combination z of elements of B; such that the 
order of g—z is at most p*-‘. 


Since we may choose as By the vacuous set, we may assume that 
the sets Bo, - - - , B, have already been constructed in such a way as 
to meet the requirements (i) to (iii). In order to construct Bs41 we 
adjoin to B, any greatest subset C of G with the following properties. 

(a) All the elements in C are of order p*-*. 

(b) The join B,4; of the sets B, and C is an independent set. 


1 Presented to the Society, April 6, 1940. 

2 Unique in that the number of nonzero terms in an expression for g is unique and 
only the arrangement but not the respective values of the nonzero terms may differ 
in two expressions for g. 
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The set B,4; satisfies (i) and (ii). In order to prove (iii) let 4 be 
any element in G whose order is at least p*~*. If firstly the order of h 
is exactly p*~*, then it follows from the conditions (a) and (b) that 
there exists an integer m and a linear combination y =) a,y; of ele- 
ments in B,,: so that O#mh=y. Assume without loss of generality 
that m is a power of p<p*-*. Then (p*-*/m)mh=0=)_a,(p*-*/m)y;. 
By (ii), p*~* divides a;(p*—*/m),so that a;/m is an integer (all 7). There- 
fore the order of h—)}>(a;/m)y;is m < p*-*. If secondly the order of h is 
greater than p*~*, then there exists a linear combination z of elements 
in B, such that the order of h—z is at most p*~*, and thus (as above) 
y=)ocxy; can be found with y; in B,4: for which h—z—y has order 
<p*-*. We have shown’ that B; is a basis of G. 


UNIVERSITY OF CALIFORNIA 


3 The following contain proofs for finite groups: Andreas Speiser, Theorie der 
Gruppen von endlicher Ordnung, 3d edition, Berlin, Julius Springer, 1937, p. 46-49. 
R. Remak, Uber die Zerlegung der kommutativen Gruppen in zyklische teilerfremde 
Faktoren, Journal fiir die reine und angewandte Mathematik, vol. 141. L. C. Mathew- 
son, A simple proof of a theorem of Kronecher, ibid., vol. 161 (1929), p. 255. A. Korselt, 
ibid. N. Tschebotaroew, Bewies der Existenz einer Basis bei Abelschen Gruppen von 
endlicher Ordnung, Kasan University, Fizzico-Matematico Obchestvo Izviestiia, vol. 4 
(1929-1930). Third line after 5) in the proof read w=w,r for o=w,=r. In the fourth 
paragraph following, read 4,=qw,+t, OSt<w, A,---t---+. The following contain 
proofs for infinite groups: H. Pruefer, Untersuchungen ueber die Zerlegbarkeit, Mathe- 
matische Zeitschrift, vol. 17 (1923), p. 53; R. Baer, Compositio Mathematica, vol. 1 
(1934), p. 274. 
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ON THE CONVEX SOLUTION OF A CERTAIN 
FUNCTIONAL EQUATION! 


H. P. THIELMAN 


The object of this paper is to give a generalization of a result ob- 
tained in a recent article by Mayer.? His result is a particular case 
(for a=1, p=1, K=0) of the following theorem. 


THEOREM 1. The only function which is convex for all x>K 20, and 
satisfies the functional equation 


(1) 1/f(x + a) = xf(x), «>0,a>0,p>0, 
1s 

T'(x«/2a) 


The following proof follows closely that given by Mayer in the 
paper to which we referred above. All real functions occurring in this 
paper will be assumed to be defined for all positive values of the inde- 
pendent variable unless otherwise specified. A function f(x) is said to 
be convex for all x >K if for every pair of values of x, say x; and x2, 
for which f(x) is defined, and 1.>K, x.>K, 


(3) (EC) < + 


That f(x) satisfies the functional equation (1) follows from the fun- 
damental equation for the Gamma function, [(x+1) =xI (x). 
We shall now show that F(x) is convex. From (2) it follows? that 


F(x) (2Zav+ x)(2av+ a+ x) 


Since a>0, p>0, it follows that F’(x)/ F(x) is increasing with increas- 
ing x. Log F(x) is therefore a convex function, and therefore F(x), 
for the convexity of a function follows from the convexity of its loga- 
rithm as can be easily verified. 

Since F’(x) <0, F(x) is a’ decreasing function of x, and we see by (1) 
that 


' Presented to the Society, April 13, 1940. 
2 A. E. Mayer, Acta Mathematica, vol. 70 (1938), pp. 57-62. 
3 Mayer, loc. cit. 
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(4) > + a) > (x + a)~?. 


We can now prove the main part of our theorem: if f(x) is a function 
convex for x > K 20, and if it satisfies (1) for all x >0, then f(x) = F(x). 
From (1) it follows that the quotient Q(x) =f(x)/F(x) satisfies the 
functional equations 


(5) Q[x + (2n — 1)a] = 1/Q(z), 
(6) Q[x + 2na] = Q(z), 


where 1 is any positive integer. We must show that Q(x) =1. First 
we suppose that Q(x) >0 for some particular value of x. Then for that 
value of x by (4), (5), and (6) 


Fl[x + (2k — 1)a] 1 
Q(x) Q(x) [x + (2k — 2)a}/? 
Q(x) 
[x 2kal?/? 


Making use of the convexity of f(x) for x>K, we may write for 
x+(2n—1)a>K, 


(7) f(x + 2ma) S f[x + (2n — 1)a] + f[x + + 1)a]}, . 


and therefore 


fix + (2k — 1)a] = 


fla + 2ka] = Q(x)F(x + 2ka) > 


Q(x) 1 f 1 + 1 
(x + 2na)?!? ~ 20(x) \[x + (2m — 2)a]?/2 + 


0%) < 1 x + 2na 
(x 
2 (La + (2n — 2)a 


If Q(x) >0, then Q(x+a) =1/Q(x) >0, and in an exactly similar way 
we find that 


1 (fx + (2n+4+ \ 
+ a) 0%(a) < {|= 


Since the last two displayed equations are true for all values of n 
such that x+(2n—1)a>K, Q?(x)=1, and therefore | f(x) | = F(x). If 
we assume that there exists an x for which Q(x) <0, an argument simi- 
lar to the preceding one shows again that | f(x)| = F(x). Hence we con- 
clude that for all values of x>0 | f(x)| = F(x). We thus see, that, for 
every value of x, f(x) is equal to either plus or minus F(x). If f(x) were 


or 
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equal to minus F(x) for some particular value of x, say x=xo, then 
by (1) f(xo-+na) <0 for all m. Hence there would exist an x, say x, 
larger than K such that f(x) <0, and f(x:) = — F(x:). Then for every 
h>0 


But F(x) is decreasing, and therefore f(x,:+4) would be negative for 
all h>0O, and equal to — F(x1 +h). This function is, however, not con- 
vex. Hence f(x) can never be negative, and must be identical with 
F(x). 

A function is said to be concave for x > K if it satisfies equation (3) 
with the inequality sign reversed. It is obvious that the only function 
which is concave for x > K 20 and satisfies the functional equation (1) 
is — F(x). 

If the condition of convexity be replaced by requirement that f(x) 
be monotone for x > K 20, we can obtain the following result: 


THEOREM 2. The only function f(x) which is monotone, not increasing, 
for all x>K20, and satisfies the functional equation (1) is F(x). 


The proof of this theorem is analogous to that of Theorem 1, with 
the convexity condition (7) replaced by the monotone relation 


(8) f(% + 2na) < f[x + (2m — 1)a] S f[x + (2m — 2)a] 


(n sufficiently large). If x >0 is such that Q(x) >0, then from (8) it 
follows that Q?(x) =1. And if it is supposed that Q(x) <0 for some x, 
(8) leads to the contradictory inequalities Q?(x) >1, Q?(x) <1. Hence 
f(x) = F(x). The details of this proof are left to the reader. 

It is also obvious from the last theorem that the only function f(x) 
which is monotone, not decreasing, for all x >K 20, and satisfies (1) 
is — F(x). 


COLLEGE OF St. THOMAS 
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AN UNSYMMETRIC FUBINI THEOREM! 
R. H. CAMERON AND W. T. MARTIN 


Because of the great variety of ways in which repeated Stieltjes in- 
tegrals may occur, the ordinary Fubini theorem for Lebesgue integrals 
has more than one analogue in Lebesgue-Stieltjes (Radon) integrals.? 
One of these (the symmetric one) is well known.* However, there is 
another form which is not symmetric, and its proof does not seem to 
be in the literature, although it is known to many writers, in special 
cases at least.* This form does not appear to be immediately derivable 
from the symmetric form, and since it is of interest in various connec- 
tions, it seems worthwhile to give an explicit statement and proof. 
We do not follow the usual procedure of beginning with finite limits 
and then allowing the limits to become infinite because the passage to 
infinity seems to present difficulties of the same order as the direct 
proof of the final result itself. The immediate use of infinite limits is 
made possible by the fact that the symmetric Fubini theorem has al- 
ready been proved with infinite limits. 


THEOREM. Let k(x) be a function of bounded variation on every finite 
interval. Let p(x, u) be Borel measurable in (x, u); for almost all x with 
respect to k(x) let it be of bounded variation in u over every finite 
u-interval. Denote by V(x, u) the variation® V(x, u)= Seis| p(x, v)| : 
Assume that [*..V(x, u)| dk(x)| exists (is finite) for all’ u. Let s(u) be 
Borel measurable on (— ©, ~). Then the existence (finiteness) of either 


(1) f ove, u) | dk(x) | 


—20 


1 Presented to the Society, February 24, 1940. 

2 The integrals which occur in this paper are understood to be Lebesgue-Stieltjes 
(Radon) integrals. See, for example, Saks, Theory of the Integral, 2d revised edition, 
Warsaw-Lemberg, 1937, pp. 19 and 67. We note that the familiar principles of mono- 
tone and dominated convergence in the Lebesgue theory are also valid in this theory; 
see pages 28 and 29. In this paper we shall not admit + © as members of our num- 
ber system; that is, existence implies finiteness. 

3 Saks, loc. cit., p. 81. 

4 See for instance, N. Wiener and H. R. Pitt, On absolutely convergent Fourier- 
Stieltjes transforms, Duke Mathematical Journal, vol. 4 (1938), pp. 420-436. The 
reader will note that a form of this theorem was used in passing from line 13 to line 14 
on page 421. 

5 For negative u we understand that V(x, u)= -f 
strictly a variation only for positive u. 

6 Clearly it would be sufficient to require that the integral be finite for u ranging 
over some sequence having + ~ as limit points. 
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or 


implies the existence of the other, their equality, and the existence and 
equality of the following integrals 


(3) u)dk(x) = fae u). 


We shall first prove this theorem under the following additional as- 
sumptions which we shall later successively remove: 

(A) s(u) is a polygonal function (that is, a continuous function 
whose second derivative exists and is zero except at a finite number of 
points), 

(B) s(u) is bounded and vanishes identically outside a finite inter- 
val, 

(C) s(u) is non-negative, k(x) is monotonic increasing and p(x, u) 
is monotonic increasing and right continuous in u and p(x, 0) =0. 

Clearly the additional hypotheses (B) and (C) imply the existence 
of (1). Also they imply the existence of (2), as one sees in the following 
manner. Let be such that s(u) =0 when |u| >n—1 and such that 
s(u) is everywhere less than or equal to m. Then since V(x, u) = p(x, u) 
by (C) we see that 


(4) f s(u)dup(x, u) nf d.p(x, u) = n[V(x, n) — V(x, — n)]. 
Also 
(5) f V(x, u)dk(x) 
exists by hypothesis. Hence 

f ae(a) s(u)dup(x, u) nf dk(x)[V(x, n) — V(x, — n)]; 
that is, (2) also exists. 

Thus under the additional hypotheses (B) and (C) we need only 
show the equality of (1) and (2) since obviously the right and left 
members of (3) are equal to (1) and (2) respectively. In order to show 


the equality of (1) and (2) under (A), (B) and (C) we integrate by 
parts and use the fact that s vanishes at —m and at n. Thus 
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(6) u)dk(x) = — J ase u)dk(x). 


By the symmetric Fubini theorem, we obtain 


(7) asc u)dk(x) = Jv. u)ds(u). 


Integrating the inner integral of the right member of (7) by parts, 
we have 


(8) [a = - [a 


and chis with (6) and (7) establishes the equality of (1) and (2) and 
thus in view of the preceding remarks proves the theorem under the 
additional assumptions (A), (B) and (C). 

Now since every bounded Borel measurable function is a repeated 
limit of continuous (and even of polygonal) functions’ we shall show 
by the repeated use of the principle of dominated convergence that 
the theorem still holds when we drop (A) and keep only the addi- 
tional hypotheses (B) and (C). As we have noted, for this purpose 
it is again only necessary to prove the equality of (1) and (2). 
For purposes of induction let us assume that s*(u) =lim;.,, s;(u) 
(— © <u< where s;(u), s2(u), is a sequence of non-negative 
functions each of which is Borel measurable and vanishes when 
|u| >n—1 and each of which is everywhere less than or equal to n. 
Assuming that the conclusion of the theorem holds for each s;(u) when 
p and k satisfy (C), we shall show that it also holds for the limit func- 
tion s*(u). By bounded convergence 


lim f p(x, u)dk(x) = f f p(x, u)dk(x); 
—% —n —20 

and since (4) holds for each s;(u) it follows by dominated convergence 
that 


n 


lim u) = fae) lim ;(u)dup(x, 


—n 


= f axe) ut). 


7 See, for example, de la Vallée Poussin, Intégrales de Lebesgue (Borel Monograph), 
1916, pp. 36, 37. 
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Thus if the theorem holds under (B) and (C) for one Baire class it 
holds under (B) and (C) for the next; and hence the theorem holds 
under (B) and (C) for all Borel measurable functions. 

In order to remove (B) (retaining only (C)) let 


(9) 


s(u) if | <n-—1 and s(u) S72, 
s"(u) = { 
0 otherwise. 


Then since s*(u) satisfies (B) and (C), 


(10) u)dk(x) = fae u). 


Since either (1) or (2) exists, both these expressions are bounded 
(in 2). By monotone convergence 


lim $"(u)de pve. u)dk(x) = f saa. u)dk(x), 


now 


lim ak(x) u) = fae lim s"(u)dup(x, u) 


= f ut). 


Thus the theorem holds under (C). 
Finally, we can dispense with (C) and assume merely the original 
hypotheses of the theorem. To show this let 


s;(u) = max {(- 1) ‘s(2), 0}, i=0,1, 
1 

(11) k(x) = sth. | dk(&)| + (— + i= 0,1, 
pi(x, u) = LIV (x, u) + (- 1) ‘[ p(x, u+0) — p(x,0+ 0) ]}, 

4= 0,1. 


Obviously all these functions are Borel measurable and satisfy (C). 
We first show that the existence of (1) or (2) for s, p and k implies 
the existence of (1) or (2) respectively for s;, p;, ki. Since we always 
have 


when f(x) — g(x) is non-decreasing, and 
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f (x) | df(x)| = f “o(2)| df(x) | 


when = 0(x), it follows that 


f “V(x, u) | dk(x)| = f f u) | dk(x) | 


and 


> f dki(x) f | dup(x, 1) | 


Thus s;, p;, ki satisfy the hypotheses of the theorem, as well as (C), 
and hence 


(— f si(u)dy pila, u)dki(x) 


Adding these eight equations yields (3), and hence completes the 
proof of the theorem. 


THE MASSACHUSETTS INSTITUTE OF TECHNOLOGY 


LINEAR FORMS IN FUNCTION FIELDS' 
LEONARD TORNHEIM 


We shall prove algebraically an analogue for function fields? of a 
well known theorem of Minkowski on linear forms.* 


THEOREM 1. Let F be a field and z an indeterminate over F. Let 


n 
(1) Li = aisx;, i=1,---,n, 
j=l 
be n linear expressions with coefficients a;; in F(z) and with the determi- 
nant | ai;| of degree* d. Then for any set of n integers (1,-- +, Cn which 
satisfy the condition >-"_,cc>d—n there exists a set of values for 
%1,°*-,X, in F[z| and not all zero such that each L; has degree at most c;. 


First, we may assume that all of the c; are equal. For, suppose 
that cis the maximum of the c;. Write L/ for L,;z°-*:. The determinant 
of the coefficients of the L/ has degree d’=d+)_(c—c;) <)oc+n. If 
there is a set of values for x, - - - , x, with the property that the de- 
gree of each L/ is at most c, then these same values will make the 
degree of L; at most ¢;. 

Next, we may assume, after multiplying each L; by a suitable poly- 
nomial and by using an argument similar to that above, that all the 
a;;arein F[z]. 

We shall now convert our system of L; by means of a transforma- 
tion of determinant unity with elements in F[z] into an equivalent 
system having a;;=0 for i<j. Let b; be the g.c.d. of the ai;; then 
b,=)>_*_ ,a1,€, for appropriate cj in F[z]. Necessarily the cj are rela- 
tively prime. It is possible to find other quantities cj, (k=2,---, ) 
such that the determinant |cjx| has value unity.’ Thus the transfor- 


1 Presented to the Society, April 13, 1940. 

* See M. Deuring, Zur Theorie der Idealklassen in algebraischen Funktionenkérpern, 
Mathematische Annalen, vol. 106 (1932), pp. 103-106, for a related result. I believe 
the results I prove are new. 

3 A bibliography of both analytic and algebraic proofs of the theorem of Minkowski 
on linear forms is given by E. Jacobsthal, Der Minkowskische Linearformensatz, 
Sitzungsberichte Berliner mathematischen Gesellschaft, vol. 33 (1934), pp. 62-64. 
See also L. J. Mordell, Minkowski’s theorem on homogeneous linear forms, Journal of 
the London Mathematical Society, vol. 8 (1933), pp. 179-192. 

* The degree of a rational function is the degree of the numerator less that of the 
denominator. Zero is assigned the degree minus infinity. 

5A. A. Albert, Normalized integral bases of algebraic number fields 1, Annals of 
Mathematics, (2), vol. 38 (1937), p. 926 ff. The statement is proved for rational 
integral cj, but the proof applies to any integral domain having the property that a 
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mation x;=)_%-,¢yxf has determinant unity and hence it has a re- 
ciprocal transformation with elements in F[z]. The forms L; are 
transformed into L/ Here and, being a 
linear combination of a;, it is divisible by their g.c.d. bi; aj,=)ia,. 
The transformation 


n 
” = 


k=2 
of determinant unity transforms the L/ into L/’ with L{’ =)x/’. 

The procedure is repeated for the »—1 linear forms M;=)_7_.0'jx}’ 
(t=2,---, m). Finally, if this process is continued, the resultant 
transformation converts the original system (1) into one with a,;=0 
for i<j. As a consequence, if the degree of a;; is d;, then }-d;=d. By 
using another transformation of determinant unity we may assume 
that the degree of each a;; is at most d;. 

Let be the set of all m-tuples (s:, --- , 52) =s where the s; are 
in F[z] and have degree not greater than c; hence G; is a linear set 
over F whose order u;=(c+1). Write L;(s) for jy ide 14;;5;. Let G, of 
order u, over F be the linear subset of G; composed of all quantities s 
for which Zi, --- , L,-1 all take values of degree not greater than c. 
Designate by P, the set of all L,(s) with s in G,, and by Q, the set of 
all polynomials in P, of degree not exceeding c. Since the maximum 
degree possible for a polynomial in P, is c+d,, the number of linearly 
independent polynomials of P, which are not in Q,, that is, the order 
of P,/Q,, is less than or equal to d,. Now G,/G,ii=P,/Q,, a fact 
which follows from the mapping of G, on P, and G,4; on Q,. Hence 
[Gi:Gnsi] Therefore the order of is not less 
than n(c+1)—d. To be sure that G,; has elements other than zero, 
we must have u,,4;21, that is, nc=)c2>d+1—n. 

The following theorem applies if some of the L; must be made equal 
to zero. 


THEOREM 2. If in Theorem 1 the first m of the L; are to be made equal 
to zero and if their coefficients are in F|z|, then the conclusion will hold 


if 


For, the first m polynomials s; must be zero if we have the trans- 
formed system used in the proof of Theorem 1. Application of Theo- 
rem 1 for the remaining L; yields Theorem 2. 


UNIVERSITY OF CHICAGO 


g.c.d. of any finite number of elements exists and is linearly expressible in terms of 
those elements, that is, that every ideal with a finite basis is principal. 
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A NOTE ON A THEOREM OF RADO CONCERNING THE (1, m) 
CONFORMAL MAPS OF A MULTIPLY-CONNECTED 
REGION INTO ITSELF! 


MAURICE H. HEINS 


Let G, denote a region in the z-plane and let w=f(z) be a function 
of z defined for zCG, which has the following properties: (1) w=f(z) 
is analytic and single-valued for zCG., (2) zCG, implies that f(z)CG,, 
(3) to each point woCG, there correspond m and only m points 2 
(k=1, 2, - - -,m) contained in G, such that f(z) =wo (k =1,2, - - -,m) 
where following the usual convention we count the 2 according to 
their multiplicities. Then w=f(z) is said to define a (1, m) conformal 
map of G, onto itself. Such maps have been studied by Fatou? and Julia* 
for the case where G, is simply-connected, and by Radé‘ who treated 
multiply-connected regions as well. Among the results which Radé 
established is the following theorem: 


Let G, be a region of finite connectivity p (>1); then there exists no 
(1, m) conformal map of G, onto itself for m>1. 


Let us remark with Radé that the theorem is no longer valid if 
G, is of infinite connectivity, as simple examples from the theory of 
the iteration of rational functions show.’ Rad6’s proof of the theorem 
just cited is based on the possibility of mapping one-to-one and con- 
formally a region of finite connectivity p, none of the components of 
its boundary reducing to points, onto a region of connectivity p, the 
boundary of which consists of p disjoint circles. Other types of canoni- 
cal regions yield the same result, notably one due to Koebe.® It is the 
object of the present note to establish Radé’s theorem directly with- 
out appeal to the possibility of mapping one-to-one and conformally 
the’ region G, onto a canonical region. Our tools are the theory of 
iteration and a simple modification of Nevanlinna’s principle of har- 
monic measure.’ 

Let G,, the region we are going to study, have as its boundary p 
(>1) disjoint continua T, (k=1, 2,---, p). It is evident that we 


1 Presented to the Society, April 27, 1940, under the title A note on a theorem of 
Radé. 

2 P. Fatou, Bulletin de la Société Mathématique de France, 1919. 

3 G. Julia, Comptes Rendus de I’ Académie des Sciences, Paris, vol. 166 (1918). 

4 T. Rad6, Acta Szeged, vol. 1 (1922). 

5 G. Julia, Journal de Mathématiques Pures et Appliquées, 1918. 

6 P. Koebe, Acta Mathematica, vol. 41. 

7R. Nevanlinna, Eindeutige analytische Funktionen, chap. 3. 
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may assume without loss of generality that the IT, are all closed 
Jordan curves. Further let w=f(z) define a (1, m) conformal map 
of G, onto itself. In Radé’s paper cited above, it is shown that if 
w=f(z) defines a (1, m) conformal map of G, onto itself, then when- 
ever z tends to the boundary of G,, so does f(z), in such a manner 
that whenever z tends to a given component I; of the boundary, then 
f(z) tends to one and only one component of the boundary I), where 
the index /; depends on k. Now the relation k—-1, (k=1, 2,---, p) 
defines a permutation of the indices 1, 2,---, p. 

We shall understand by f,(z), the nth iterate of f(z), that function 
defined by the recursive relations 


(A) folz) = 2, filz) = f2), fal2) = 


By the definition of w=f(z) it is clear that the definition given by (A) 
for f,(z) is meaningful. Since the relation k—/, defines a permutation 
of the indices 1, 2, - - - , p, a suitably chosen power of this permuta- 
tion is the identity. Hence for a properly chosen whole number 1p, 
fn.(2) has the property that when z tends to a component I; of the 
boundary of G,, then f,,(z) tends to exactly the same component I’;. 
Let us denote f,,(z) by F(z). 

We are now in a position to demonstrate Rad6’s theorem directly. 
By the harmonic measure of z with respect to T;., denoted by w(z, Tx, G.) 
we understand that harmonic function defined for zCG, which is 
single-valued and bounded for zCG, and further takes on the bound- 
ary value 1 on T; and the boundary value 0 on all the components of 
the boundary of G, exclusive of T’;.8 

The relations 


(B) w(F(z), Tx, Gz) = w(z, G2), k=1,2,---,p, 


are an immediate consequence of the principle of the maximum for 
harmonic functions. For, as z tends to a given component of the bound- 
ary of G., F(z) tends to the same component. Hence w( F(z), Tx, Gz) 
has the same boundary values as w(z, I';, G.), hence the identity. To 
establish Radé’s theorem we need consider only one of these identi- 
ties, say 


(C) w(F(z), T1, G.) = w(z, Ti, G2). 


Let be a point lying in G,, and let w(2o, T1, G.) (O<Ao <1). Then 
our identity (C) implies 


w(F;,(20), G.) = Xo, 


8 R. Nevanlinna, ibid. 
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where F,,(z) is the mth iterate of F(z) for n=1, 2,---. Hence if z is 
on a given level curve defined by w(z, T1, G.) = (0<A<1), F(z) is 
on the same level curve for n= 1, 2, - - - . This permits us to conclude 
that no limit function of the sequence { F,(z)} is a constant. From 
this fact we infer that w= F(z) defines a (1, 1) conformal map of G, 
onto itself, and hence so does w=f(z); that is, m cannot exceed one. 

Suppose contrary to our assertion that m>1. Then w= F(z) would 
define a (1, m”*) conformal map of G, onto itself and w= F,(z) would 
define a (1, m””°) conformal map of G, onto itself. Let { Fx, (2) } be a 
convergent subsequence of { F(z) } and let Fo(z) denote the limit 
function of this subsequence. A point wo has, under the map defined 
by w=F,,,(z), m="° antecedents all of which lie on the level curve 
defined by w(z, T1, Gz.) =w(wo, Ti, G.). But by Hurwitz’s theorem, 
for k, sufficiently large F;:,(z)—wo and Fo(z)—wo have the same 
number of zeros, and this is manifestly impossible. Hence m cannot 
exceed one. 

It is interesting to note that the technique employed in the present 
proof of Radé’s theorem, that is, a combined use of the theory of 
iteration and of an extended form of the principle of harmonic meas- 
ure (the modification consisting in the fact that we did not require 
the continuity of F(z) on T',) may be applied to other problems in the 
theory of the conformal mapping of multiply-connected regions. In 
particular, such a technique permits us to conclude that the number 
of (1, 1) conformal maps of a given multiply-connected region of finite 
connectivity p, where p>2, bounded by # disjoint continua is finite. 
Koebe’s proof of this theorem is based on the fact that the region in 
question can be mapped one-to-one and conformally onto a canonical 
region. We shall not give in the present note the details of a direct 
proof of this theorem. Let us remark however that it is based on 
the fact that for p>2 each of the harmonic functions w(z, T;, G,.) 
(k=1, 2,---, p) has precisely p—2 (>0) critical points. A study of 
the level curves containing these critical points yields the desired re- 
sult. 
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NON-CYCLIC ALGEBRAS OF DEGREE FOUR AND 
EXPONENT TWO WITH PURE MAXIMAL 
SUBFIELDS! 


ROY DUBISCH 


In a recent paper? A. A. Albert proved the falsity of the converse 
of the well known proposition that a cyclic normal division algebra 
contains a quantity 7 whose minimum equation is x*=j in the base 
field of the algebra. The proof consists in giving an example of a non- 
cyclic normal division algebra containing a quantity j7 as described 
above. The algebra described in that example was of degree and ex- 
ponent four. It is the purpose of this paper to show that the exponent 
does not affect the property, and this we shall do by constructing 
similarly an algebra of degree four whose exponent is two. 

We shall actually prove the following theorem. 


THEOREM. Let £ and n be independent indeterminants over the field R 
of real numbers, K = R(&, n). Then there exist non-cyclic normal division 
algebras of degree four and exponent two over K such that t‘=y in K, 
t? not in K, for some quantity t in each algebra A. 


To make our proof we use the known? property stating that a nor- 
mal division algebra of degree four has exponent two if and only if 
it is expressible as a direct product of two algebras of degree two. 
Therefore we may take our desired algebra A to be 


A=BXC= (I, i, j, ij) X (1, xy), 
(1) ji=-tj, P=u, jf? u~#0,aO0in K, 
yx 0#0,b640in K. 


We seek first a quantity ¢ with minimum equation #=7y in K. Now 
if we take t=a;i+a2j+a3tj, where ai, a2, a3 are in K(x), and if we put 


(2) a=di+dex, a3=fitfex, ci, di, fiin K, 
we can easily compute that if 


2 2 
+ 
di + div — fiu — fruv 


1 Presented to the Society, April 13, 1940. 

2 A.A. Albert, Non-cyclic algebras with pure maximal subfields, this Bulletin, vol. 44 
(1938), pp. 576-579. 

3 A. A. Albert, Normal division algebras of degree four, Transactions of this Society, 
vol. 34 (1932), p. 369. 


(3) 
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and if 
(4) be => 2d,d2a 2fifoua 0, 


the algebra A defined by (1) contains a quantity / such that ##=7y in K, 
t? not in K. 

In proving A a non-cyclic division algebra we use the following 
method.‘ Let 2?= 62+-e?=A in K, 6 and ein K with L=K(z) a quad- 
ratic field over K. The algebras B}=BXL and Co=CXL over L are 
generalized quaternion algebras over their reference field L. Further- 
more, Aj=A XL is the direct product Ay = By X Co. Then it is known® 
that a necessary and sufficient condition that A» over L shall be a 
division algebra is that the quadratic form 


(5) + ans uads + bAs obds) 


in the variables Ai, Ae, - - - , Asin L shall not vanish for any Ay, - - - , As 
not all zero in L. 

If J=RIE£, 7] is the integral domain of all polynomials in £ and 7 
with real coefficients, it is obvious that we may, without loss of gen- 
erality, take the A; in (5) to be in J. Hence we may write 


(6) = a + Bz, s¢=1,---,6, 
where the a; and 6; are in J. Then Xj = (af +67A) + 2.8.2 so that if 


(7) Pr=ai+Bi, Qi = 206i, 
the equation Q=0 becomes 
uP, + — uaP; — vP, — bP; + vbPs 
+ + aQ2 — uaQs — 104 — 60; + = 0. 


But 1 and z are linearly independent® with respect to K so that (8) 
implies that 


(9) n) = uP, + aP2 — uaP; — vP, — — vbP, = 0. 


(8) 


Now if in (3) we let ¢ =c@.=d,=1, d.=f,;=f2=0, we obtain 


(10) a= — (u+ wu). 


* This is the device used by A. A. Albert in his paper A construction of non-cyclic 
normal division algebras, this Bulletin, vol. 38 (1932), pp. 452-453. 

5 A. A. Albert, On the Wedderburn norm condition for cyclic algebras, this Bulletin, 
vol. 37 (1931), p. 311, Theorem 3. 

§ It is obvious that we can have this condition hold by a suitable choice of 6 and «. 
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Note that these choices also satisfy (4). Substituting this value for a 
in (3) we have 


(11) o(é, n) => — P2) — uvP,+ u?P3; + uP; — — bPs + vbPs. 


With these choices for the parameters of our algebra A is easy to 
prove by the well known method of an argument’ on the degrees of 
the polynomial in (7) that $(£, 7) cannot be identically zero unless 
all of the coefficients are zero. From this it follows that Ao over L isa 
division algebra over L; so is algebra A a division algebra over K. 
By its form it is a normal division algebra of degree four and exponent 
two over K. 

The proof of the non-cyclic character of A is exactly the same as 
that given by A. A. Albert in a previous paper,* and with this we have 
the desired theorem. 


WILson Junior COLLEGE 


7 A similar argument was employed by A. A. Albert in the paper last cited. 
§ This Bulletin, vol. 38 (1932), p. 454. 


NOTE ON NORMALITY IN QUASI-GROUPS 
D. C. MURDOCH 


An interesting problem in the theory of quasi-groups is to deter- 
mine how strong an associative law must be assumed in order to ob- 
tain a theory of normal subquasi-groups similar to that of ordinary 
groups. The properties of normal subgroups which it is desirable to 
retain in the non-associative case are, first, that they form a Dede- 
kind structure, and second, that each one gives rise to a quotient 
group homomorphic to the whole group. In this note we shall take the 
latter property as the definition of normality and show that the 
former follows from it under very general conditions. 

We shall understand by a quasi-group G a system of elements 
satisfying the following two postulates: 


I. PRoDUCT AXIOM. Any ordered pair of elements a, b of G has a 
unique product ab which also belongs to G. 


II. QUOTIENT AXIOM. For any two elements a and b of G there exist 
unique elements x and y of G such that 


ax = ya = b. 


We shall make one further assumption, namely that there exist 
left coset expansions with respect to any subquasi-group H of G. 
In other words, if H is any subquasi-group, the cosets aH and bH 
are either identical or have no elements in common. This assumption 
has been shown by Hausmann and Ore [1] to be equivalent to the 
following: 


III. WEAK ASSOCIATIVE LAW. If a and b are arbitrary elements of 
G and co a fixed element, let do be determined so that (ab)co=adyo. Then 
for any c 


(ab)c = ad, 
where d belongs to the quasi-group { co, do, c} generated by Co, do and c. 


Since no confusion can arise, we shall for convenience use the term 
subgroup for subquasi-group and quotient group for quotient quasi- 
group, without implying thereby that the systems in question are 
associative. 

It should be noted that a theory of normal subgroups has been 
given by Hausmann and Ore [1] based on the definition of a normal 
subgroup as a subgroup H such that aH=Ha for all a in G. This 
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theory requires that G should satisfy associative laws considerably 
stronger than III in order that the normal subgroups so defined shall 
have the desired properties. To overcome this difficulty we shall adopt 
instead the following: 


DEFINITION: A subgroup H of G shall be called normal if 
(aH)(bH) = (ab)H, 
for all elements a and b of G. 


This is the same definition used in a previous paper by the au- 
thor [2] where it was applied, however, only to quasi-groups satisfy- 
ng rather special associative laws. It will be shown here that the 
results of that paper concerning normal subgroups can be extended 
to any quasi-group satisfying ITI. 

Comparing this definition with that of Hausmann and Ore, it is 
seen that neither one implies the other. Under the latter all subgroups 
of a commutative quasi-group are normal, while under the former 
this property is enjoyed by the “abelian” quasi-groups [2] which 
satisfy the law (ab) (cd) = (ac) (bd). In a Suschkewitsch quasi-group [3] 
(having unique right unit e and satisfying a(bc) =(ab)(ec), the right 
unit ¢ is a subgroup which is normal in the sense of this paper but not 
in the sense of Hausmann and Ore. 

Examples of quasi-groups satisfying III but not satisfying the 
stronger laws previously discussed can easily be constructed by re- 
arranging the elements in the Cayley square of a group, or known 
quasi-group, so as to destroy the associativity without disturbing the 
existence of cosets. The following, for example, 


8 9. 


satisfies III since cosets exist with respect to all subgroups. It is easily 
verified that it satisfies none of the laws previously discussed by the 
author nor the laws As, A3, A, or As of Hausmann and Ore. 
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It is natural to require next that every element of a coset aH shall 
define the same coset aH, or in other words that (ah)H =aH for all h 
in H. In this connection we have the following: 


THEOREM 1. The necessary and sufficient condition that every coset aH 
may be defined by any one of its elements is that H contain the right unit 
of every element of G. 


ProorF. If H contains all right units of G, then aH =(ah)H for all h 
in H, since each coset contains the element ah. Conversely if 
aH =(ah)H for all hin H and alla in G, then ah must belong to (ah)H. 
Hence H contains the right unit of ah for all a in G and therefore 
contains all right units of G. 

The main object of this note is to prove that in any finite quasi- 
group the normal subgroups which contain all right units of G form 
a Dedekind structure. With a slight strengthening of the definition 
of normality the result may also be proved in the infinite case. We 
shall assume for the present, however, that G is finite. 


THEOREM 2. If H and K are normal subgroups of G and contain all 
right units of G, their crosscut H(\K =D 1s also normal. 


Proor. From the normality of H and K we have 
(aD)(bD) (ab)D, 


and since D is finite the equality must hold. Moreover D is not void 
since it contains all right units of G. 


THEOREM 3. If H and K are normal and contain all right units of G, 
their union HUK consists of all products hk where h belongs to H and 
ktoK. 

The proof of this thecrem is similar to that of Theorem 7 of the 
author’s previous paper already cited. The union HUK may there- 
fore be written as the product HK. 


THEOREM 4. If H and K are normal, and contain all right units of G, 
their union is also normal. 


Proor. Using the notation e, for the right unit of a, we have, from 
the normality of Hand K, 


[a(HK) = | 
[(aH)K][(bH)K] = [(aH)(bH)|K = [(ab)H]K 
[(ab)e.s|[HK] = (ab)(HK), 


which completes the proof. 
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THEOREM 5. The normal subgroups of G which contain all right units 
of G form a Dedekind structure. 


PrRooF. From Theorems 2 and 4 these subgroups form a structure. 
It is only necessary, therefore, to show that if MDH, 


MO(HUK)CHU(MQO &). 


This follows as usual since every element of the left-hand side has the 
form hk and belongs to M. Hence k belongs to M(/K since h belongs 
to M. 

In establishing Theorem 5 we have used the finiteness of G in the 
proofs of Theorems 2, 3 and 4. This can be avoided by strengthening 
the definition of normality as follows: 

A subgroup H shall be called normal in G if for all elements a, b 
of G and every element h of H we have 


(ah)(bH) = (ab)H, (aH)(bh) = (ab)H. 


This of course implies the other definition and is equivalent to it in 
case G is finite. This strengthening of the definition enables us to 
prove the quotient axioms for the union and crosscut in Theorems 2 
and 3. For example, for the union, the equation x(/,k,) =heke has a 
unique solution of the form x=hk. For choosing h so that hh, =hn, it 
follows from (2) that there exists an element k of K such that 


(hk) (hy) = heko. 
For the proof of Theorem 4 we have as in the finite case 
a(HK) = (ae,)(HK) C (aH)K. 


But in the infinite case it is necessary to show also that (aH)K 
Ca(HK). This follows from III, applied in the form of Theorem 7, 
Chapter I, Hausmann and Ore. 

From Theorem 5 the usual theorems concerning normal subgroups 
follow. In particular, the Jordan-Hélder theorem for principal chains, 
and the Schmidt-Remak decomposition theorems, as proved by Ore 
[4] for arbitrary Dedekind structures, will all hold. One interesting 
question arises which has no counterpart in ordinary groups, namely 
whether the subgroup R, generated by the right units of G, is neces- 
sarily normal. Although there seems to be no obvious reason to be- 
lieve that this is true, I have no counterexample. If R is not always 
normal the question arises whether infinite quasi-groups exist in which 
the Dedekind structure of normal subgroups has no unit element. 
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YALE UNIVERSITY 


ON THE FIRST CASE OF FERMAT’S LAST THEOREM 
D. H. AND EMMA LEHMER 


In 1909 Wieferich [1] proved his celebrated criterion for the first 
case of Fermat’s last theorem, namely: 
The equation 


(1) xP + y? = 27, x, ¥, 2 prime to p, 


has no solutions unless 


(2) 2”-1 = 1 (mod p?). 
Since that time numerous other criteria of the form 
(3) m?-! = 1 (mod 


have been proved by Mirimanoff [2] (for m=3), Vandiver [3] (for 
m=5), Frobenius [4], Pollaczek [5], Morishima [6], and Rosser [7] 
for all prime values of m<41. 

Wieferich’s criterion alone has been applied by Meissner [8] and 
Beeger [9] for p<16,000 and was found to be satisfied only for 
p= 1,093 and 3,511, both of which cases failed to satisfy Mirimanoff’s 
criterion. 

Until recently no effort has been made to combine these various 
criteria in a practical way. Mirimanoff observed, however, in 1910 
that his criterion and that of Wieferich could be combined to state 
that equation (1) has no solutions for all primes of the form 2738+ 1 
or |2*+ 34]. 

In the presence of more criteria this statement can be extended thus: 

We call a number an “A, number” (after Western) if it is divisible 
by no prime exceeding the mth prime pnp. If the criterion (3) has been 
established for all m<p,, then equation (1) does not hold if p is the 
sum or difference of two A, numbers [10]. Since all the numbers less 
than ~,41 are A, numbers, we may state that equation (1) has no 
solution for any prime in a region where the A, numbers are so dense 
that they do not differ by more than 2p,,;:—1. This method was used 
in 1938 by A. E. Western [11] to show that (1) is impossible for 
16,000 < p< 100,000. 

A more powerful method of combining the criteria was suggested 
recently by Rosser [12], who observes that while the congruence 


(4) x?-! = 1 (mod p”) 
has only (p—1)/2 solutions less than p?2/2, every A, number is a 
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solution of (4) if (1) holds and if (3) has been established as far as 
m= p,. Hence if @,(x) denotes the number of A, numbers not exceed- 
ing x, then 

(5) on(p?/2) S (p — 1)/2, 

a condition which cannot hold for small p’s if the A, numbers are 
sufficiently dense. 

Other inequalities of this kind can be derived by separating the 
solutions of (4) into classes. We shall do this here only in the case in 
which the solutions are distinguished by their parity. For every posi- 
tive odd solution w< p*/3 of (4) there exists also a solution (p?—w)/2 
(using Wieferich’s criterion) which lies between p?/3 and p?/2, and 
hence differs from w. The number (p—1)/2 of solutions of (4) not 
exceeding p?/2 is then at least the number of even A, numbers less 
than p?/3 plus twice the number of odd A, numbers less than p?/3. 
This can be written as 


(6) n(p?/3) + on(p?/3) S (p — 1)/2 
where ¢x(x) denotes the number of odd A, numbers less than x. 

If one is to apply an inequality such as (5) or (6) outside the limit 
of existing tables [13] of A, numbers, it is necessary to find lower 
bounds for ¢, and @¢*. Rosser [12] has given a lower bound for 
¢,(10*) in the form of a polynomial f,(x) of the mth degree. 

By an improved method [14] which makes use of Bernoulli poly- 
nomials we have constructed polynomials P,,(x) and Q,(x) of degree 
n giving lower and upper bounds for ¢,(107), and also a polynomial 
P_,(x) of degree n—1 giving a lower bound for ¢,*(107). As we shall 
actually need not Px_; but the sum P,+P%_1, we tabulate the fol- 
lowing polynomials for »=13: 


* 
P;3(x) P;3(x) +Pi2(x) Qis(x) 

-0°1380608198 x! -0°91380608198 -0°1380608198 
-071272704088 -071326732670 .071326732670 x" 
.0°5247670000 x -0°5691949617 -0°5729012056 x 
.0*1277269076 -041439087748 .0*1469359894 
-0°2039391192 x? .0°2383232393 .0°2493297644 x°® 
-072244543403 .0?2716892889 .0?2952478417 x8 
-01740607057 <x’ -02179700460 <x’ -02510548659 x? 
-09545572654 x5 .1235281024 .1557243217 x8 
.3654889719 x5 -4882665632 x5 .7120647453 
-9452666752 1 .302427247 x‘ 2 .432810990 x! 
1.542870198 2.191273599 x? 6.376807029 x? 
1.361483164 x 1.996360212 x? 13 .24421612 x? 
.3459458265 x -5410860433 <x 21.15029718 x 

— .2049399029 — .2911608671 19 .69764203 


1941] FERMAT’S LAST THEOREM 141 


If we now replace (5) by 
P,;(log p?/2) = (p 1)/2, 


we find, by actual substitution into P.3, that this inequality holds 
only for p=93,785,629, and hence (1) has no solution! for p<93,- 
785,629. 

Since 


Qia(log p?/2) S (p — 1)/2 


holds only for p> 141,000,000, it follows that the inequality (5), even 
if we knew the exact value of ¢,(x), could not be used beyond this 
limit for n=13. 

The inequality (6) becomes 


Pis(log 2/3) + Pia(log p?/3) (p — 1)/2. 


This holds only for p>102,108,200. 

Hence the first case of Fermat’s last theorem is now proved for 
p< 102,108,200. 

Further criteria of type (3), when established, may be used to ex- 
tend this limit by calculating approximating polynomials of higher 
degree from the ones given above by the method described in [14]. 

Note added March 1. Since writing the above, Dr. Rosser has kindly 
sent us the manuscript of his forthcoming paper [15] in which he 
completes Morishima’s proof that the prime 43 gives also a criterion 
of the form (3). We have therefore calculated P,4(x) and P}3(x) 
from the Pj; and Pi, given above. These polynomials are as follows: 


* 
Pi,(x) P,3(x) Py4(x)+Pis(x) 
-0"6037145739 -0%6037145739 
-0°6683705079 x -0°2544306733 x8 -0°6938135752 
-073337944336 x .082536688143 x” -073591613150 x” 
-0°9940243443 -0°1131551216 x"! -0°1107179466 x"! 
-041965570473 x19 -0°2980593461 x19 -042263629819 
.032719317505 -045152041706 -033234521676 
.022700968174 -076140455952 .0?3315013769 x8 
-01944029859 x7 -095157919367 x7 .02459821796 x? 
-1009940169 = .03064345185 x -1316374687 x8 
.3720257684 .1271048263 = -4991305947 x5 
-9365601794 x! -3559975936 1.292557773 x4 
1.500217863 -6290315435 x3 2.129249406 x 
1.299882088 2 6014604442 x 1.901342532 x? 
.3050249350 x -1730336327 =x -4780585677 x 
— .2081092173 — .08700841024 — .2951176274 


1 The corresponding result obtained by Rosser is about 41,000,000. 
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We find that (5) is satisfied only if p>230 millions, while (6) gives 
the better inequality p= 253,747,889. 
The first case of Fermat’s last theorem is therefore now established 
for all p <253,747,889. 
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LEHIGH UNIVERSITY 


PROOF OF A THEOREM OF HALL 
HARRY GOHEEN 


In the Journal of the London Mathematical Society for July, 1937, 
Mr. Philip Hall gave a proof of the theorem, “If a group G of order g 
has a subgroup of order m for every divisor m of g such that 
(m, g/m) =1, then G is a soluble group.” The proof is a very simple 
one in contrast to the rather difficult proof of the converse theorem 
which Hall had published in the same journal for April, 1927. It seems 
worthwhile to give a simpler proof of this converse. 


THEOREM. [f g is the order of a soluble group G and m is a divisor of g 
such that m and g/m are relatively prime, then G has a subgroup of order 
m and furthermore all the subgroups of order m in G are conjugate un- 
der G. 


ProoF. Since the theorem is true by default for prime power groups, 
let us suppose that it is true for all soluble groups of orders less than g 
and use the method of complete induction. Then the theorem is true 
for an invariant subgroup G’ of prime index r in G. 

If r divides g/m, then G’ has a subgroup of order m. Since G’ is 
invariant and (m, r) =1, every element of order dividing m in G must 
be in G’. Hence all the subgroups of order m in G must be in G’ where 
by hypothesis they form a complete set of conjugates under G’ and 
consequently a complete set of conjugates under G. 

If r divides m, then G’ has a complete set of conjugates of order 
m/r. Since G’ is invariant, the subgroups of order m/r in G’ are a com- 
plete set of conjugates under G. Let M’ be one of these subgroups of 
order m/r in G’. 

If M’ is the only subgroup of order m/r in G’, then, it must be in- 
variant in G. Then the quotient group is of order rg/m and since r is a 
Sylow divisor of the order of this quotient group, it has a subgroup of 
order r. Then G has a subgroup of order r-m/r or m. On the other 
hand if M’ is not the only subgroup of order m/r in G’, let it be one 
of k subgroups of order m/r in G’. Then k divides the order g/r of G’ 
and the normalizer of M’ in G is of order g/k divisible by m. Since 
this normalizer is a soluble group of order g/k less than g, it has a 
subgroup of order m. 

There remains only to show, for the case r divides m, that all the 
subgroups of order m in G are conjugate under G. Let M be a sub- 
group of order m. If there is no other subgroup of order m in G, then 
the theorem is true by default. However, if M, is another subgroup 
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of order m in G it will be shown that Mand M, are conjugate under G. 

Let the crosscut of M and G’ be [ of order y and the crosscut of M, 
and G’ be I; of order y:. Then since G is generated by M and G’ as 
well as by M, and G’ and since G’ is invariant under both M and M, 
we have 


(g/r)(m)/y = g, = 8, 
whence 
= m/r. 


Then since [ and I, are of order m/r, they are conjugate under G’. 
If STS =T,then S-'M,Sand M have a common invariant subgroup 
I. If T is invariant in G, the quotient group G/T is of order gr/m. 
Since r is a Sylow divisor, the subgroups M/T and S-!M,S/T of order 
r are conjugate under G/T and hence M and S-!M,S are conjugate 
under G as was to be proved. If, however, IT is not invariant under G, 
its normalizer is a proper subgroup of G containing M and S-'M,S 
which are therefore conjugate under the normalizer of I as was to 
be proved. 


REED COLLEGE 


ON REGULAR FAMILIES OF CURVES! 
HASSLER WHITNEY 


A family F of non-intersecting curves filling a metric space is called 
regular if, in a neighborhood of any point p, it is homeomorphic with a 
family of straight lines. We have given in another paper? a necessary 
and sufficient condition, which we shall call (A’) (to be described be- 
low), that a family F be regular. We shall prove in this note that the 
following condition is sufficient: 

(A) Given any point p, and a direction on the curve through , 
there is an arc pq in this direction with the following property. For 
every €>0 there is a 5>0 such that for any p’, with p(p’, P) < <4, there 
is an arc p’q’ of C(p’) such that 


(1) CVApg), CVQ). 


The condition (A’) is the same, except that after (1), we add: 

(2) If r’ and s’ are on p’q’ and p(r’, s’) <6, then 6(r’s’) <e. 

From the present theorem it is clear that the families of curves 
recently defined by Niemitzki? are regular. 

To prove the theorem, suppose (A) holds, but (A’) does not. Then 
the following is true: 

(B) There is a point , and a direction of the curve C(p), such that 
for any arc pg on C(p) in this direction, there is an e>0, such that 
for any 5>0, there is a point p’, with p(p’, p) <6, such that for any q’ 
on C(p’), 

(3) either p’g’¢ V.(pq), or g’F V.(Q), 


1 Presented to the Society, April 27, 1940. 

2 Annals of Mathematics, (2), vol. 34 (1933), pp. 244-270. We refer to this paper 
as RF. By RF, Theorem 7A, F is regular as there defined. The converse is proved as 
follows. By Theorem 174A, there is a cross-section S through p. In a neighborhood of p, 
the curves are orientable (this is easily seen, for instance, with the help of Theorem 
9B). Choose an open subset S’ of S, and let U be all points g’ =g’(g, «), gin S’, | a| <e 
(see RF, §15); U is a neighborhood of p, expressed as the product of S’ and the open 
line segment —e<a<e. 

By a curve, we shall mean here the topological image of an open line segment or 
of a circle. We shall use p(, g) for distance, 5(A) for the diameter of the set A, and 
V.(A) for the set of all points p, p(p, A) <e. Let C(p) mean the curve of F through p. 

3 V. Niemytzki, Recueil Mathématique de Moscou, vol. 6 (48) (1939), pp. 283-292. 
We mention two further papers in the subject: H. Whitney, Duke Mathematical 
Journal, vol. 4 (1938), pp. 222-226, showing that if the curves fill a region in 3-space, 
a cross-section may be chosen so as to be a 2-cell; W. Kaplan, Duke Mathematical 
Journal, vol. 7 (1940), pp. 154-185, studying families filling the plane. 
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(4) or there are points r’, s’ on p’q’ such that p(r’, s’) <6, and 
5(r’s’) De. 

Choose a point p and a direction on C= C(p), by (B). Choose g on C 
in this direction, by (A). Choose e>0 by (B). For each positive in- 
teger i, choose 6; by (A), with e replaced by €/i. Choose p; by (B), 
with 6 replaced by 6;. Choose g; by (A), with p’ replaced by ;; then 


(5) pigi C C 
By (B), as ¢;<€, we may choose p/ and q/ on p.gi so that 
(6) p(pi,gi) <6,  S8pigi) Ze. 


By (6), we may choose r; on pg! so that p(p/, r;) 2€/2. By (5) 
and (6), we may choose a subsequence so that for some points p’ and r 
on pq, 


then rp’. Say, for definiteness, that r is in the direction of g from p’. 
The set of such points r which are limits of such sequences {rx,} forms 
a closed set, which, by (5), is in p’q; we shall let r be the point furthest 
from p’. (It might be q.) 

Assuming that (A) holds for the point r and the direction away 
from p’, we shall arrive at a contradiction. Choose a point s on C in 
this direction from r, by (A). (If C is a closed curve, it might happen 
that s is on the arc pr.) Choose r’ and s’ on C just behind and just in 
front of r, so that r’ is on neither pp’ nor rs, and s’ is not on p’r. We 
shall show that for any e’>0 there is an integer j and a point s; on 
piq} within ¢’ of s’; as s is in pg, by (5), this will contradict the 
definition of r, and thus prove the theorem. 

Set 


(8) 4n = min [p(r’, rs), 2¢’]. 


Choose r_, r;, S_, S$, on C in the order r_r’r,rs_s’s,s, so that r_ is not 
in rs and s, is not in pr (if C is closed), and so that 


(9) S58, C VAS), 
Set 
(10) = min [o(pr_, r+); p(rs_, 545), 


Using r, s, and e’’, choose 6’’>0 by (A). By (7), we may choose j so 
that 


(11) e/j <€’, p(p/, p') < < < 8". 


E 
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By the choice of 5’’, we may choose s* on C(r;) so that 
(12) r;s* C V.--(rs), s* C V-_-(s). 


As 1;s* is a connected set, (11), (12) and (10) show that there is a point 
s; on it such that p(s;, rs_t+s,s) 2e’’; hence, by (12), p(s;, s_s,) <€’’, 
and by (9), 


(13) 8’) Ke’ nse. 
By (12) and (8), 

(14) p(1’, 7383) > 2n. 

By (5) and (11), 

(15) C Ve(pq). 


By (11), (10), (5) and (9), there are points p7 in pjr; and g# in 7r;qj 
such that 


(16) 7’) <2n, 7’) < 2n. 


By this and (14), the arc r;s; is contained in the arc p?rjq*Cpj qj. 
Hence s;C pj q}, which, with (13), gives the contradiction. 
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TRANSLATED FUNCTIONS AND STATISTICAL 
INDEPENDENCE! 


R. P. AGNEW AND M. KAC 


1. Introduction. Professor Steinhaus? proposed the question wheth- 
there exists a function f(#) defined in (— ©, ©) such that, for each 
sequence Ai, Ae, - - - of different real numbers, the “translated” func- 
tions f(¢+A:), f(¢+A2), -- - form a sequence of statistically independ- 
ent functions.* We shall answer this question in the affirmative by 
giving concrete examples, and shall discuss some related problems. 


2. Notation and lemmas. Let 
1 T 
(1) f(t)} = lim —] 
2T J 


denote as usual the mean value of f(t) in case the limit exists. If Eis a 
point set on the line (— ©, ~) and g(¢) is the characteristic function 
of E, the mean value AT { g(t) } (if it exists) is called the relative measure 
of E and will be denoted by | E| . A non-decreasing function o(a@) such 
that o(— ©) =0, o(+ ©) =1 is called an asymptotic distribution func- 
tion of f(t) if 


(2) | <a} | = ofa) 

at each point of continuity of o(a). A set fi(t), - - - , fn(é) of functions 
having asymptotic distribution functions o;(a@), - - - , n(a) is called 
statistically independent if 

(3) | ELAM ++ 5 fal) < an} | = 

for each set a1, - - - , @, of real numbers such that a; is a point of con- 


tinuity of o,(@). An infinite set of functions is called statistically inde- 
pendent if each finite subset is statistically independent. 

Our proofs will be based on the following theorem of Kac and Stein- 
haus, loc. cit. 


1 Presented to the Society, February 24, 1940. 

2 Ina letter to one of the authors. The problem arose in connection with the theory 
of turbulence, but because of the outbreak of the war we have not been able to get 
details. 

3 See M. Kac and H. Steinhaus, Sur les fonctions ind épendantes IV, Studia Mathe- 
matica, vol. 7 (1937), pp. 1-15; and P. Hartman, E. R. van Kampen, and A. Wintner, 
Asymptotic distributions and statistical independence, American Journal of Mathe- 
matics, vol. 61 (1939), pp. 477-486. 
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LemMA 1.4 set fi(t), - - - ,fn(t) of bounded functions having absolutely 
continuous distribution functions is statistically independent if and only 
af the equality 


= kj kj 
j=1 j=1 
holds for each set ki, ke, -- +, kn of positive integer exponents. 


Repeated use will be made of the following simple lemma. 


Lemna 2. If $(t) is a real differentiable function and constants tr >0 
and a>O exist such that 6'(t) is monotone and |p’(t)| >a over each of 
the intervals (— ©, and (rT, ~), then 


(5) = 0, 


Let lh, I2, and J; denote the integrals of exp i¢(t) over (—T, —7), 
(—7, 7), and (rt, T) respectively. Applying the second mean value 
theorem we obtain, when J >r, 


T ¢'(t) cos g(t) ¢’(t) sin 
I; = J +i 


‘(t) cos dt 


¢'(T) 
and evaluating the last four integrals gives | Z| <8/a. Likewise 
| I,| <8/a, and since | J;| <27 our lemma follows. 

If (t) isa real polynomial in ¢ of positive degree, Lemma 2 is obvi- 
ously applicable. The same is true if ¢(t) =>_”_,a, exp (¢+A,)? where 
An are different real numbers and - - - , @, are real coeffi- 
cients not all 0. 


3. An example. We can now establish our main result. 


THEOREM 1. The translations of the function 


(6) f(t) = cos e* 
form a statistically independent set; that is, for each set \1,---, Xn of 
different real numbers, the functions f(t+:),---,f(t+An) are statis- 


tically independent. 
Our first step is to verify that (4) holds. We find that 
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TI cost = J] 2-* [exp + exp (— 


j=1 j=1 
n kj 
j=1 a;=0 


- [exp (— i(k; — 


n kj 
II 2-* >> Ci;.a; exp i(2a; — 


j=1 a;=0 


kn 


a,=0 a,=0 


- exp i), (2a; — 


j=l 


Since 2{f} is an additive homogeneous operation, we can apply 
Lemma 2 to show that 


(7) ai} J] cos*i = 0 


j=l 


if at least one of ki, ke, -- - , Rn is odd; but 


(8) an} II cos*i = II 5/2 


j=1 j=1 
if ki, ke, ---, Rk, are all even. Noticing that the particular case n=1 
gives 
(0) cos* = 0, k odd, 
= k even, 


we see that the mean value of the product is the product of the 
mean values as in (4). Moreover the fact that the moments of 
f(t) =cos exp (t+)? are the same as the moments of cos ¢ implies‘ 
that the distribution function of f(t) is the same as that of cos ¢ and 
hence is absolutely continuous. Therefore we can apply Lemma 1 to 
obtain the conclusion of Theorem 1. 


4. Functions which oscillate less rapidly. The function cos exp 
of Theorem 1 oscillates very rapidly as |t|—+. A natural question 


4 A. Wintner, Uber die statistiche Unabhangigkeit der asymptotischen Verteilungsfunk- 
tionen inkommensurabler Partialschwingungen, Mathematische Zeitschrift, vol. 36 
(1933), pp. 618-629. 


n n 
n 
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arises whether exp /? can be replaced by a positive integer power of t, 
say t™. If we put f(¢) =cos ¢", then the answer to the question of statis- 
tical independence of the translations f(t+);), f(t+d:2),--- is only 
partially positive. 


THEOREM 2. If f(t) =cos t”, m being a positive integer, then each set 
of m different translations f(t+:), - - - , f(t+Xm) is a statistically inde- 
pendent set; but (m+-1) different numbers dy, - - - , \ms1 can be chosen in 
such a way that the corresponding set of (m-+-1) translations is not sta- 
tistically independent. 


Proof of this theorem is similar to that of Theorem 1. For each 
n=1,2,--- we find on setting 


(10) P,(é) = Il cos*i (t + ;)™ 


j=1 
that 
ki kn 


a,=0 


where $(t) =@(Ai, , An; Ri, Rn3 Qn; is given by 


j=l s=0 j=1 


Applying Lemma 2, we see that 2 {exp i¢(t) } +0 if and only if 


(13) (2a; = 0, =0,1,---,m—1. 
j=1 

In case n=™m, (13) is a system of m homogeneous linear equations 

in m quantities (2a;—k;) whose determinant (Vandermonde’s de- 

terminant) does not vanish; and if (13) holds, then 2a;=k; when 


j=1i, 2,---, m. Hence, in case n=m, mM {exp ip(t) } =0 unless 
ki, ---, Rm are even and a;=k;/2 when j=1, 2,---, m. Therefore, 
in case n»=m, formulas analogous to (7) and (8) hold and statistical 
independence of f(t+A,), - - - , f(t+Xm) follows as in the proof of The- 
orem 1. 

In case m>1 and Ay, Ae, --- , An are linearly independent (in the 
sense that no one of \y, - - - , A, is a linear combination with rational 
coefficients of the remaining) then (13) again implies that 2a;=k; and 
we conclude that f(t+A:), - - - , f(¢+A,) are statistically independent. 


In case m >1, this shows existence of infinite sets of statistically inde- 
pendent translations of cos ¢™. 
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We complete the proof of Theorem 2 by proving the following theo- 
rem. 


THEOREM 3. If m is a positive integer and dy, 2, - are m+1 
different rational numbers, then the set of (m+-1) translations 


(14) cos (¢ + cos + - ++ , Cos (t + Amy1)™ 
is not statistically independent. 


We use the formulas obtained by setting »=m-+1 in (10), (11), 
(12), and (13). The system 


m+1 

(15) >d aj;x; = 0, s=0,1,---,m—1, 
j=1 

of m equations in (m+1) unknowns %, - - - , Xm41 is such that if a 


nonzero rational value is assigned to one of the x’s, then the re- 
maining x’s are uniquely determined and are rational. It follows that 
(15) has non-trivial solutions in integers. Let X denote the set of 


points x=(x1,---, Xm41) in (m+1)-dimensional space for which 
X1,° °°, Xm41 are integers, not all 0, satisfying (15). For each xEX, 
let ||x||?=27+ - - - +22,,,. Since ||x||? is necessarily a positive integer, 
the minimum value of ||x||? for x@X is an integer R>O. Choose 
such that llyl]2=R, and let ki, ---, Rms be 
defined by 
(16) ki =| j=1,2,---,m+1. 
If 0 <a; <k; for each =1, 2, - - - ,m+1 while 0 <a;<k; for at least 

one j, then 

m+1 2 m+1 2 m+1 2 
(17) (2e;- ki) < Dk = Lyi =R 

j=1 j=1 j=1 


and the minimal property of R implies that the point 
(18) — hi, — he, - , — 


is not in the set X ; this means that at least one of the equations in (13) 
fails to hold, and accordingly M {exp ip(t) } =0. The remaining 2”*" 
terms in the sum (11) are those for which each aq; is either 0 or k; 
and accordingly (18) is a point on the sphere (hypersphere) S with 
center at (0, -- - , 0) and radius R"/?, But the set X, being on each of 
m independent planes (hyperplanes), is on their line of intersection; 
hence at most two of the points on S can be in the set X. It follows 
that the point (18) fails to be in X, and hence that At {exp ip(t)} =(), 
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unless the indices a1, - - - , @n41 are such that (i) 2a;—k;=~y; for each 
j=1,---, m+1 or (ii) 2a;—kj;= —y; for each j=1,---, m+1. In 
case (i), (13) holds so that $(¢) is independent of t, and we see that 
o(t) =0 where 


m+1 


(19) > 


j=l 


in case (ii), = Therefore 


m+1 
(20) [I cos i (t+ X,;) \ 


j=1 


- 2-km+1(2 cos 0) 0, 


the last step being a consequence of the fact that the right member of 
(19) is rational and hence that @ cannot be an odd multiple of 7/2. 


At least one of ki, ke,--+, ny: must be odd, for otherwise 
2=(y:/2,---, ¥m41/2) would be a point of X with ||z||?=||y|]2/4 
= R/4<R; therefore 

m+1 
(21) [I {cos* (¢ +2) =0. 


j=1 


Comparing (20) and (21), we see that (4) fails when n=m-+1 and 
f(t) =cos (t+A,;)"; hence Lemma 1 implies that the (m+1) transla- 
tions in (14) are not statistically independent. This completes proof 
of Theorems 3 and 2. 

Anyone who wishes to check Theorem 3 by consideration of a sim- 
ple special case will find that if m=2, \i=1, \2.=2, As=3 then 
y=(1, ye, ys) may be either (1, —2, 1) or (—1, 2, —1); that k,:=1, 
ke=2, ks=1; and that elementary formulas for products of cosines 
and use of Lemma 2 gives 


(22) {cos (t + 1)? cos? (¢ + 2)? cos (¢ + 3)2} = 8-! cos 2, 
which is a special case of (20). 


5. Other functions. Our methods can be used to show that certain 
rapidly oscillating functions f(t) other than the function cos exp ?? of 
§3 have statistically independent translations. Our first solution of 
the problem of §1 was a “direct” (but too tedious to be given here) 
proof of the fact that the set of all translations of the function 
cos exp exp ?/? is statistically independent. Our “direct” method uses 
the definition of statistical independence rather than the criterion of 
Lemma 1, and applies only to functions which oscillate very rapidly. 


= 
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To handle such functions as cos ¢”, where the question of statistical 
independence of translations is more delicate, we need the criterion 
of Lemma 1. 


6. Applications. As soon as statistical independence of a set of func- 
tions is established, we can apply the general theory of addition of 
independent random variables.’ As an illustration, we mention the 
following result: If f(t) =cos exp ¢?, and Au, Az, --- is a sequence of 
different real numbers, then 


(23) lim < | =— f 


qil2 


CORNELL UNIVERSITY 


5 See Kac and Steinhaus, loc. cit., and further references given there. Also P. Lévy, 
Théorie de I’ Addition des Variables Aléatoires, Paris, Gauthier-Villars, 1937. 


NOTE ON A CERTAIN TYPE OF 
DIOPHANTINE SYSTEM! 


E. T. BELL 


1. The quadratic case. Several writers? have considered the prob- 
lem of making a*x?+dx, b?x?+-ex,--- simultaneously squares, a, d, 
b, e,--- being constants, without discussing the conditions under 
which an integer solution exists. The fact that the coefficients of x? 
are squares is of no consequence. It is shown in §2 how necessary and 
sufficient conditions for the existence of integer solutions in much 
more general problems (§2, (9), (13)) can be determined and how, 
when these conditions are satisfied, the solutions may be found. The 
details are given first for three very special cases, (1), (5), (6) below. 

If all letters denote integers, and a, b, c, d are constant, we seek 
necessary and sufficient conditions that the system 


(1) ax? + bx = y?, cx? + dx = 2? 


shall have a solution x, y, z. We shall assume that abcd £0, as the ex- 
cluded possibilities require only slight modifications, all of which are 
included in the general method. 

The required conditions are that b, d be simultaneously represent- 
able in two forms of degree seven. Precisely, (1) has a solution in in- 
tegers x, y, z if and only if integers f, g, h, k, m, n exist such that 


(2) b = hf?(m? — ag*k?), d = hg?(n? — cf*k?). 
Provided such f, g,--- exist, the complete solution of (1) is 
x = hf*gk?, y = hf*gkm, = hfg*kn, 


where f, g, h, k, m, n run through all solutions of (2). 
To prove this, we rewrite (1) as 


ax(ax + b) = y’, x(cx + d) = 2’, 
which is of the form 
(3) xu = y’, xv = 27, 


with 
u=ax+b, 


The complete integer solutions of the respective equations in (3), 


1 Presented to the Society, April 6, 1940. 
2 Summary of results to 1920 in L. E. Dickson, History of the Theory of Numbers, 
vol. 2, 1920, chap. 18. 
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with x, y, 2, u, v unrestricted variables, are 
2 2 
x = 7351, u = ryt, y = 
2 2 
x = 1282, v = Pele, 2 = 
in which all the letters with suffixes denote arbitrary integers. Since 
in the system (3) the two values of x must be equal, we have 


2 2 
(4) 1151 = 1252, 


of which the complete integer solution is 
n=hf, n=hg, kf, 


where f, g, h, k are arbitrary integers. From the (restricted) values 
u=ax+b, v=cx+d of u, v, we find the stated conditions on 5}, d, after 
the change of notation (f, t2)—>(m, n). The completeness of the solu- 
tions of (3), (4) implies the completeness of the solution of (1) when 
f, g,--- run through all solutions of (2). This treatment is possible 
only because (3), equivalent to (1), is a pure multiplicative system. 
A full discussion of such systems, with a non-tentative method for 
finding their complete solutions, was given in a former paper.* 

The above solutions of (3), (4) were obtained by this method. 

Proceeding similarly with 


(5) ax? + bx = y’, cx? + dx = 2?, ex? + jx = w’, 


in which a, b, c, d, e, 7 are constant integers, different from zero (not 
an essential restriction), we find that all solutions x, y, z, w are given 


by 


x = y = hf*gkpq?rs?l, 
= hf*g*k*pqrsm, w = hfg*kpgq?r’sn, 
where h, - - - , m run through all integer solutions of 


b= hf?gs*(? ak?g?p?r?), d= hf?g?k?(m? cp*q?r’s?), 
= hg*q?*r?(n? ek? f?p?s?), 


Here the simultaneous representation of 6, d, 7 which is necessary 
and sufficient for the existence of a solution of (5) is by forms of degree 
fifteen. This may suggest that anything approaching a complete solu- 
tion of the systems quoted? is not to be expected in the near future. 
However, any number of solvable systems can be constructed immedi- 


* American Journal of Mathematics, vol. 55 (1933), pp. 50-66. 
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ately from the conditions, and likewise for such systems having infini- 
ties of solutions. 
The pure multiplicative system corresponding to (3) for (5) is 


of which the complete solution is 
hf*q*s*, re = hf*g?k?, = hg*q*r’, 
kgpr, Se = pars, $3 = kfps. 


The special features of (1), (5) and the systems cited? which make 
it possible to treat the equations as a pure multiplicative system are 
that each equation lacks a constant term in one of its members and 
that its other member consists of a single term. The second of these 
restrictions is removed in §2. It is immaterial that the single term has 
the coefficient unity; for multiplicative systems with arbitrary con- 
stant integer coefficients can also be solved completely by a straight- 
forward, non-tentative process.* 

For example, the pure multiplicative system corresponding to the 
system 


(6) ax? + bx = ry’, cx? + dx = s2?, 


$1 


in which the constant integers a, b, c,d, r, s are all different from zero, 
is 


(7) xu = ry?, xv = sz’. 


We first solve (7) as if all the letters were variable integers, and find 
the complete solution 


22 22 222 2 
2 
(8) Z = 
22 22 
= Lef351X1 121, = 
2 2 
Y = tryrorsa, S = 151505384, 


in which ¢ and all the letters with suffixes denote arbitrary integers. 
If now 7, s are constant, we resolve them in all possible ways into 
products of five factors as indicated above, and it suffices (as appears 
in the method cited*) to take ¢t equal to the G.C.D. of r, s. With each 
such pair of resolutions of r, s there is associated a solution x, y, 2, u, v 
as given in (8); all the solutions of (7), with 7, s constant, fall into sets 
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determined by these pairs. Proceeding as in (1)—(4), we find that (6) 
has an integer solution x, y, 2 if and only if integers x1, x2, V1, Ye, 21, 22 
exist such that, for some resolution of 7, s as in (8), 


22222 

b = — 
2 2 22 222 

d = — 


and if these conditions are satisfied, the complete solution x, y, z of 
(7) is as given in (8), in which ¢ and the 7;, s; refer to all resolutions 
of the constants r, s of the type indicated. The extension to a system 
of n (>2) quadratics is immediate. 


2. The general case. The notation is as follows. 

P (x) isa polynomial in x of degree m;+r;, m;>0, r;>0, with con- 
stant integer coefficients. 

The term of lowest degree in P;(x) is aix™, a; 0. 


Pi(x) = 2™Q(x), Q(x) = Rix) + a4, 


so that R;(x) isa polynomial in x of degree r; >0 with constant integer 
coefficients and no constant term. 

b:, n;, s denote constant integers, n;>0, s>1. 

By the method exemplified in §1, we may obtain necessary and 
sufficient conditions that the system 


shall have a solution in integers x, y:, -- - , ys. The conditions concern 
the representability of the constant integers a;,---, ds in certain 
forms; the complete solution of (9) is known when all these repre- 
sentations of a1, -- - , @, are known. 
Rewriting (9) as 
™m; ni 

(10) x [Ri(x) a;| = ,= 1, 


we associate with (10) the pure multiplicative system 


(11) x™iz; = by", t=1,---,5s. 
The complete integer solution x, 2;, yi,7=1,---, 5s, of (11) is found 
as in §1 in terms of ft, =f(m,---, ms, m,--*-, Ms), parameters; say 


the solution is 


(12) x= f(m,---,m), 2 = Vi= 


E 
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The f, gi, 4; are power products in , - - - , u, with integer coefficients 
which are power products formed from the factors in certain resolu- 
tions of the constants }y, - - - , b, into products; the forms of f, g:, h; 
and the coefficients described depend only on the given constant in- 
tegers m;, n;. Necessary and sufficient conditions that the system (9) 
have a solution in integers x, y;,7=1, - - - , s, are found as in §1, and 
refer to the representation of the constants a; in certain forms. Com- 
paring (10), (11), we find 


a; = gilts,--- , Ue) — Ri(f(ms,--- , u=1,---,5s, 


as the required conditions. When these are satisfied, the solutions are 
obtained from (12). 

An obvious modification gives necessary and sufficient conditions 
for the existence of an integer solution, and so on, of the more general 
system 


(13) P(x) = 4=1,---,s, 


in which c;, b; are constants, P;(x) is as before, and T ;(y;) is a poly- 
nomial in y; of degree n;+1;, n;>0, t;>0, in which the term of lowest 
degree is d;yj!, d:~0. The associated pure multiplicative system is 


mi 


the conditions concern the representability of the a;, d; in certain 
forms. As in all cases the conditions are both necessary and sufficient, 
the complete solution of the given system is equivalent to finding the 
total representation of the coefficients a;, d; in a determinable system 
of forms. 


CALIFORNIA INSTITUTE OF TECHNOLOGY 


= 


POWER SERIES THE ROOTS OF WHOSE PARTIAL 
SUMS LIE IN A SECTOR! 


LOUIS WEISNER 


If the roots of the partial sums of a power series f(z) =) a,2" lie 
in a sector with vertex at the origin and aperture a<2z, the power 
series cannot have a positive finite radius of convergence.” But if f(z) 
is an entire function, the roots of its partial sums may lie in such a 
sector. The question arises: what restrictions are imposed on f(z) by 
the requirement that a be sufficiently small, say a<? According to 
a theorem of Pélya the order of f(z) must be not greater than 1 if the 
radius of convergence of the power series is positive.* Without this 
assumption the investigation which follows shows that if a<z, f(z) is 
an entire function of order 0. This result was obtained by Pélya for 
the case in which a=0.4 


LemoMA. If the complex numbers 2%,---, 2n 2n%0) He ina 
sector with vertex at the origin and aperture a<7, then 
nm cos a/2 1 
| | n k=l 
When a=0 equality occurs if and only if = --- =Zn. When a>0 


equality occurs if and only if n is even and n/2 of the numbers are equal 
to re** (r >0;05@<2n) and the other n/2 numbers are equal to re‘t, 


Suppose first that the sector is —a/2 Sam zSa/2. Let the nm num- 
bers be 


ze = | 
Since 
k=1 k=1 k=l 


1 Presented to the Society, April 27, 1940. 

2 This follows from Jentzsch’s theorem: every point on the circle of convergence of 
a power series is a limit point of roots of its partial sums. See R. Jentzsch, Unter- 
suchungen zur Theorie der Folgen analytischer Funktionen, Acta Mathematica, vol. 41 
(1917), p. 219; E. C. Titchmarsh, Theory of Functions, 1932, p. 238. 

3G. Pélya, Ueber Annadherung durch Polynome deren sémtliche Wurzeln in einen 
Winkelraum fallen, Nachrichten der Gesellschaft der Wissenschaften zu Gottingen, 
1913, pp. 325-330. 

4G. Pélya, Ueber Annéherung durch Polynome mit lauter reellen Wurzeln, Rendi- 
conti del Circolo Matematico di Palermo, vol. 36 (1913), pp. 279-295. 
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n 


D | 2x | cos = cos «/2 >>| 


k=1 k=1 


IV 


IV 


Zp 


(4) | 
nN 


Consequently 


IV 


cos a/2| 21-- 


| 


Since the numbers z;', - - - , z,' also lie in the sector —a/2<am z 
Sa/2, we have 


n 


k=1 


IV 


1 
nN 


Combining the last two inequalities, (1) results. 
When a=0, (1) reduces to the well known relation among the har- 
monic, geometric and arithmetic means of n positive numbers. Here 


equality occurs if and only if 2:= --- =z,. 
If equality occurs in (1) it also occurs in (3) and (4). By (4), | z,| 
| . By (3), cos 0, =cosa/2; hence 0, = +a/2 (k=1,---,n). 


By (2), >-2_; sin 6,=0. Therefore if a>0, m must be even, and n/2 of 
the numbers equal re~‘¢/?, while the other 2/2 numbers equal re‘@!?. 
Conversely, when these conditions are satisfied, equality is attained 
in (1). 

If the numbers are in the sector ¢Sam zSam (¢+a), we apply 
the transformation 


= 
which rotates this sector into the sector —a/2 <am zSa/2 without 
affecting the value of any member of (1). 


THEOREM. If, for each n=no, the roots of the partial sum of degree n 
of the formal power series f(z) =). °_94,2" lie in some sector with vertex 
at the origin and aperture a<r,' then f(z) is an entire function of order 0. 


The case in which f(z) is a polynomial is trivial and is excluded from 


5 While a is independent of , we do not require that there shall be one sector which 
contains the roots of all the partial sums of degree m = 1p; the lines bounding the sector 
may be different for different values of n. 


Pope 
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consideration. We shall suppose a9#0; otherwise a power of z could 
be removed from f(z) without affecting the theorem. Let 


f,(z) = a,z*, n => No. 
k=0 
By the Gauss-Lucas theorem the roots of f, (z) are also in the sector 
which contains the roots of f,(z), and the only roots of f,/ (z) that lie 
on the boundary of the sector are multiple roots of f,(z); hence 
f. (0) #0. Repeated applications of this argument yield the result that 
(k=0, 1,---). 


According to the lemma, if 2, - - - , z, denote the zeros of f,(z), 
| do | | do 1/n 1 | | 
(5) at <—| |, = cosa/2;n = 
lai} mc| a, | 


From the first two members of this inequality it follows that 
|a,| "> with n. Therefore f(z) is an entire function. If p is its 
order, 


(6) — = lim inf 


1 1 1 1 
— log ; log 
n | n—1 | 
(7) 
> — —— log | ao| + log nc. 
n(n — 1) | n—1 
Let m=max (no, 4), n>m. Substituting n=m, m+1,---, min (7), 
and adding, we obtain 
1 1 1 1 1 
— log — 2 log ; 
n m—1 | | — 1) 
n lo og s 
sum S s=m 
Now 
log (s — 1 1 1 
| s—1 2 2 


; > log m — log (m — 1). 


| 
1 log | an 
p ne n log n 
From the last two members of (5) we have 


1941] POWER SERIES 163 


Consequently 
1 1 1 

(8) — log —— > A + — log? n + log c log n, 
n an | 2 


where A is bounded as n—>«. Comparing (6) and (8), we conclude 
that p=0. 


HwunNTER COLLEGE OF THE CITY OF NEW YORK 


ON A RESULT OF HUA FOR CUBIC POLYNOMIALS! 
ALVIN SUGAR 
In a paper by L. K. Hua,? we find the following principal result. 


THEOREM. For any positive integer €, every integer can be expressed 
in infinitely many ways as a sum of seven values of the cubic function 


f(x) = (x? — x)/6+ x 


for integral values of x: also, every integer can be expressed in infinitely 
many ways as a sum of seven values of 


F(x) = (x? — x)/6 
for integral values of x. 


In this paper we get a better result by applying a known identity 
for cubes to generalizations of the above polynomials. We state our 
results in the following two theorems. Note that, in Theorem 1, € may 
be positive or negative, or, for that matter, zero. 

Unless otherwise stated all letters in this paper stand for integers, 
positive, negative, or zero. 


THEOREM 1. For any ¢, c, and any k prime to e€, every integer can be 
expressed in infinitely many ways as a sum of five values of the function 


p(x) = ex? — x)/6 + ka +0 
for integral values of x. 


THEOREM 2. For any k and c, every integer can be expressed as a sum 
of four values of the function 


P(x) = — x)/6+ kx +e 
for integral values of x. 


Theorem 1 is trivially true when e=0. For, in this case, (€, k) =1 
implies k = 1. Henceforth we take e+0. 


1 Presented to the Society, April 27, 1940. 

2 On the representation of integers by the sums of seven cubic functions, Téhoku 
Mathematical Journal, vol. 41 (1935-1936), pp. 361-366. 

3 Hua fails to mention in his formulation of this theorem whether his e may take 
negative values. It seems that Hua implicitly assumed e positive, as was noted by Pall 
in his review of the Hua paper in the Zentralblatt fiir Mathematik, vol. 14, p. 10. 
(This assumption was probably unnecessary, however.) 
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Let us first prove Theorem 1 for Hua’s polynomial f(x). Since x? —x 
is divisible by 6, it is evident that 


(1) — 2) + == (mod | ¢|), 


for any integer x. Hence it is evident that there exist | «| incongruent 
values modulo | e| of f. Consequently for any integer 7 it is true that 
there exist integers ¢ and m such that 


(2) n = f(t) + me. 


We can show that me is a sum of four values of f by employing the 
following identity: 


(3) f(m + 1) + f(m — 1) + 2f(— m) = me. 
From (2) and (3), we get 
(4) n = f(t) + f(m + 1) + f(m — 1) + 2f(— m), 


which tells us that every integer is a sum of five values of f(x), x in- 
tegral. Since we can replace ¢ in (2) by any number congruent to it 
modulo |e|, it follows that the representation (4) can be made in 
infinitely many ways. 

With but slight modification of this proof, we can obtain the 
more general result stated in Theorem 1. In this case (1) becomes 
p(x) =kx+c (mod | e| ). And, since k is prime to e, kx-+c ranges over 
a complete set of residues modulo | | when x does. Now if we replace f 
by p in (3), the right member of this identity becomes me+4c. From 
these facts we can deduce Theorem 1. 

If we replace f by P in (3), the right member of our identity be- 
comes m-+4c, which ranges over all of the integers with m; hence we 
have proved Theorem 2. It is evident from Theorem 1, with e=1, that 
we could duplicate Hua’s “infinitely many ways” in our Theorem 2 
if we increased the number of summands to five. 


Cooper UNION INSTITUTE OF TECHNOLOGY 
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DIVISORS OF ZERO IN MATRIC RINGS 
NEAL H. MCCOY 


1. Introduction. An element a of a ring S is a divisor of zero in S if 
there exists a nonzero element x of S such that ax=0, or a nonzero 
element y of S such that ya=0. The purpose of the present note is 
to obtain several theorems about divisors of zero in matric rings 
which, although quite elementary in character, have apparently not 
been previously noted. Throughout, unless otherwise stated, R will 
be used to denote an arbitrary commutative ring with unit element. 
Let R, denote the ring of all matrices of order m with elements in R, 
and R[A] the ring of polynomials in the indeterminate \ with co- 
efficients in R. If A is an element of R,, we shall denote by 
f(\)=|A—A| the characteristic polynomial of A, and thus f(A) is 
an element of R[A], with leading coefficient 1. The ideal m of all ele- 
ments g(A) such that g(A) =0 is the minimum ideal of A. If the minors 


of X—A of order n—1 are denoted by hj;(A) (7, 7=1, 2,---, m), it 
has been shown in a previous paper! that g(A) =0 (m), if and only if 
hi ;(A) = 0 (f()), 4,3 1, 


If R[A] denotes the subring of R, generated by A together with 
the unit element of R,, which we identify with the unit element of R, 
then the elements of R[A | are the polynomials in A with coefficients 
in R. It is quite easy to show that A is a divisor of zero in R, if and 
only if | A| is a divisor of zero in R. But we shall show, in §2, that A 
is actually a divisor of zero in R[A ] if it is a divisor of zero in R,—a 
fact which is almost trivial if R is a field. This theorem is used in the 
following section in which we define, by means of the Sylvester de- 
terminant, the resultant R(f, g) of two elements f(A) and g(A) of R[A] 
and show, following Frobenius, that if f(A) has leading coefficient 
unity, 


R(f, g) =| 8(A)|, 


where A is any matrix having f(A) as characteristic polynomial. It 
then follows readily that an element g(A) of R[\] is prime to? m if and 


1 Neal H. McCoy, Concerning matrices with elements in a commutative ring, this 
Bulletin, vol. 45 (1939), pp. 280-284. Hereafter, this paper will be referred to as M. 

2 For definitions of this term, see W. Krull, [dealtheorie in Ringen ohne Endlich- 
keitsbedingung, Mathematische Annalen, vol. 101 (1929), pp. 729-744. This will be 
referred to later as K. A definition will also be found in §3 of the present paper. 
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only if it is prime to f(A), which in turn is true if and only if R(f,g) 
is not a divisor of zero in R. 

If A is such that m=(f(A)), we may say, following Sylvester, that A 
is not derogatory. Let us, as above, denote the minors of A—A of 
order n—1 by hi;(A), and set 


We conclude our remarks by showing that, if each ideal in R has a 
finite basis, A is not derogatory if and only if a contains an element 
of R which is not a divisor of zero in R. 


2. Divisors of zero in R[A ]. We shall now prove 
THEOREM 1. An element A of R, is a divisor of zero in R[A | if and 
only if | A| is a divisor of zero in R. 


The necessity follows easily by a familiar argument. Suppose that 
A is a divisor of zero in R[A] and hence that there exists a nonzero 
element X of R[A] such that AX =0. In other words, if A =(a;;), the 
following system of equations, 


n 
Dd = 0, 4=1,2,--- 
j=1 
has a solution #,---, in R with some Multiply these 
equations in order by the respective cofactors of dia, dea, °° * » Gna in 


|A|, and add. There results #,|A|=0, and hence |A| is a divisor 
of zero in R. 

We now pass to the second part of the proof. Accordingly, we as- 
sume that there is an element k#0 of R such that k| A | =0, and shall 
show that A is a divisor of zero in R[A]. Our method is to assume 
that A is not a divisor of zero in R[A] and obtain a contradiction. 

First, we prove the following 


Lema. If B and C are elements of R, and k is an element of R such 
that kB=kC, then k| B| =k| C}. 


This follows from the fact that kbi;=kc:i;, where B=(b;;) and 
C=(c;;), together with the following calculation. If b=b;,:bi,2 - - - Dian 
is any term, except possibly for sign, in the expansion of |B |, and 
Cizn the corresponding term in the expansion of | C|, 
then 


kb 
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and a continuation of this process shows finally that this is equal to kc. 
We now turn to the proof of Theorem 1. The matrix A satisfies its 
characteristic equation,*? and hence there is a relation of the form 


(1) A* + a,A™'+--- +0, =0, 


where a, = +|A|, and thus ka, =0. Hence, from (1), we get an equa- 
tion of the form 


kA (A 4 a,A*-? 4) = 0. 


But, by our assumption that A is not a divisor of zero in R[A ], it 
follows that 


(2) k(A nl 4 n—2 = 0, 
or, as we may write it, 
+ 4+ ++ + = — 


Now, applying the lemma, we see that 


n—2 n—3 nn 
kj A|-|A” +--+ + = 1) ann. 
But k| A | =0, and therefore ka®_, =0. Suppose kai_, ~0, but kat! =0 
(720). Then k’ =ka’,_, 0 has the property that k’a,=k’an_1=0, and 
hence from (2), 
k'A(A™2 + --- + a,-2) = 0. 

A repetition of this argument shows that there is an element k’’ #0 
of R, which is a multiple of k’, such that k’’a,=k’’dn1=hk’'an_-2=0. 
Continuing this process we finally get an element /#0 of R such that 
1A =0, which violates our assumption that A is not a divisor of zero 


in R[A |. The theorem is therefore established. 
Our method of proof also establishes the following 


Coro.iary. If A is a divisor of zero in R,, there exists an element 
g(A) of R[A | such that 


Ag(A) =0,  g(A) #0, 


where g(d) is an element of R[| with the property that, for some positive 
integer 1, 


Aig(h) = 
f(X) being the characteristic polynomial of A and k an element of R. 


3 See M, p. 282. 
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3. Resultants and their properties. We first make some preliminary 
remarks.‘ Let a be an ideal in the commutative ring R, with unit 
element. An element 6 of R is said to be prime to a if bx =0 (a) im- 
plies that x=0 (a), otherwise D is not prime to a. An element 6 is 
prime to the element a if b is prime to the principal ideal (a). If every 
element of an ideal 6 is not prime to a, but every proper divisor of b 
contains an element prime to a, then b is necessarily a prime ideal 
which we shall call a maximal prime ideal belonging to a. Krull has 
shown, by methods which are necessarily transfinite if R is unre- 
stricted, that every ideal has maximal prime ideal divisors and that 
the elements of R which are not prime to a are precisely the elements which 
are contained in some maximal prime ideal belonging to® a. 

We now consider the ring R[\], where ) is an indeterminate, and let 


= A* + +4, g(A) = bA™ + + --- + dy 


be two elements of R[A], the leading coefficient of f(A) being 1. In 
case R is a field, the vanishing of the resultant of f(A) and g(A) fur- 
nishes a necessary and sufficient condition that f(A) and g(A) be not 
prime to each other.* We shall now obtain an analogous result in case 
R is unrestricted. 

Let us denote the resultant of f(A) and g(A) by R(f, g) and define 
it in a purely formal way by the Sylvester determinant 


RUS, 8) 


the blank spaces consisting of zeros. 
The connection with the preceding part of the paper is indicated by 


pris2. 

5 In fact, it can be shown that if in R every ideal has a finite basis, the maximal 
prime ideals belonging to a are precisely the maximal among the prime ideals associ- 
ated with the primary ideals in a normalized decomposition of a into the intersection 
of primary ideals. All of these prime ideals are the prime ideals belonging to a. 

6 In case R is a field, this implies that f(A) and g(A) have a common factor of de- 
gree at least one. 
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THEOREM 2. If A is any element of R, which has characteristic poly- 
nomial f(d), then 


(3) R(f, g) =| |. 


This is an almost immediate consequence of Frobenius’ theorem for 
the case in which R is the field of complex numbers.’ For if we con- 
sider the elements of a matrix A, as well as the coefficients of g(X) to 
be independent complex variables, relation (3) is seen to be an iden- 
tity in these variables and thus remains true if these variables are re- 
placed by elements of® R. 

Now let A be any fixed element of R, with characteristic polyno- 
mial f(X). The ideal m of all elements h(A) of R[A] such that h(A) =0 
is the minimum ideal of A. It is now clear that an element g(A) of 
R[A|] is prime to m if and only if g(A) is not a divisor of zero in R[A], 
and Theorem 1 states that this is the case if and only if |g(A)| is 
not a divisor of zero in R. We shall now prove 


THEOREM 3. An element g(d) of Rd] is prime to m if and only if 
it is prime to f(r). 


By the preceding remarks, and Theorem 2, we see that g(A) is prime 
to m if and only if R(f, g) is not a divisor of zero in R. But clearly 
R(f, g) depends only on f(A) and g(A) and not on the particular choice 
of matrix A with characteristic polynomial f(A). Let A’ be a matrix 
with characteristic polynomial f(A), and such that its minimum ideal 
is? m’ =(f(A)). Then g(A) is prime to m’ if and only if R(f, g) is not a 
divisor of zero in R. But it was found above that this is precisely the 
condition that g(A) be prime to m, and the theorem is established. 

Since g(A) is prime to m if and only if R(f, g) is not a divisor of 
zero in R, we have incidentally proved the following result which is 
independent of the theory of matrices: 


THEOREM 4. If R is a commutative ring with unit element and X is 
an indeterminate, an element g(d) of R[d| is prime to the element f(n), 
with leading coefficient unity, if and only if R(f, g) 1s not a divisor of 
zero in R. 


4. A characterization of matrices which are not derogatory. Let 


7 G. Frobenius, Ueber lineare Substitutionen und bilineare Formen, Journal fiir die 
reine und angewandte Mathematik, vol. 84 (1878), p. 11. 

8 Cf. M, p. 281. 

® This will certainly be the case if A’ is the companion matrix of f(A). See relation 
(4) of §4. 
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A be an element of R,, with characteristic polynomial f(A) and 
minimum ideal m. Following Sylvester, we may say that A is not 
derogatory if m=(f(A)). Let hi;(A) denote the minors of \X—A of order 
n—1. It was remarked above that an element g(A) of R[A] isan 
element of m if and only if 


We shall now make use of this fact in a proof of the following theorem: 


THEOREM 5. If in R each ideal has a finite basis, the matrix A is 
not derogatory if and only if the ideal 


contains an element of R which is not a divisor of zero. 


Suppose 0 is an element of R which is not a divisor of zero and that 
b=0 (a). Then, from (4), it follows that if g(A) =0 (m), then bg(A) =0 
(f(A)). But f(A) has leading coefficient 1, and from this fact, and the 
fact that b is not a divisor of zero, an easy calculation shows that 
g(A) =0 (f(A)). Since always f(A) =0 (m), this shows that A is non- 
derogatory. 

Now let us assume that A is not derogatory which means that 
g(A)a=0 (f(A)) implies that g(A)=0 (f(A)). Under hypothesis of a 
finite basis for each ideal in R, this means that a is not divisible by 
any prime ideal belonging to’ (f(A)). This implies, in turn, that there 
is an element q(A) of a which is not in any prime ideal belonging to 
(f(A)) and is thus prime to" f(A). But since g(A) is prime to f(A), it 
follows by Theorem 4 that R(f, g) = | q(A)| is not a divisor of zero 
in R. Let |q(A)| =a. Extend R to a ring R’ consisting of elements of 
the form r/a‘ («=0, 1,---;7 in R).!2 In R’, a has an inverse and 
therefore g(A) has an inverse'* which is a polynomial in A with co- 
efficients in R’, say 


P(A) = (bo/ai*)A* + (b:/a*)A*! + --- + (0; /a*). 
Since g(A)p(A) =1, in R we have a relation of the form 


10 Cf, van der Waerden, Moderne Algebra, vol. 2, p. 41. 

11 In fact, it may be shown by induction on k that if every element of an ideal a 
is contained in some one of the prime ideals }i, Po, - - > , Px, then a is divisible by some 
one of the pj. 

12 See R. Holzer, Zur Theorie der primaren Ringe, Mathematische Annalen, vol. 
96 (1927), p. 722. 

13 Cf. M, p. 282. 


172 N. H. McCOY 


where 3 is not a divisor of zero. If we set 


we have in R[A] 


= 6 (m), 
which, by hypothesis, implies that 
= 5b (f(r). 


Now g(A) =0 (a) and clearly also f(A) =0 (a), from which it follows 
that b=0 (a). The theorem is therefore established. 


SmitH COLLEGE 


